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METHODS FOR DIAGNOSING AND TREATING DISEASES AND CONDITIONS 
5 ASSOCIATED WITH PROTEIN KINASE C X 

Field of the Invention 
This invention relates to methods for diagnosing and treating diseases and 
conditions associated with Protein Kinase C X. 

10 

Background of the Invention 
The processes by which organs acquire global structures and patterns during 
development are highly complex, and likely involve multiple, overlapping biochemical 
pathways. In the vertebrate heart, for example, the first key visible step in this process is 
15 chamber morphogenesis, involving the fasMoning of the atrium and the ventricle. 
Proper orientation of these two functionally distinct contractile units is required for 
unidirectional blood flow, which begins with the first heartbeat of an organism. Properly 
formed chambers thereafter are the substrates upon which further heart development is 
superimposed. 

20 Over recent years, much has been learned about the molecular mechanisms that 

are responsible for the acquisition of characteristic atrial and ventricular cell fates 
(Fishman et al., Development 124:2099-2117, 1997; Srivastava et al., Nature 407:221- 
226, 2000). However, both embryological and molecular steps that fashion the higher 
order structures of these chambers have proven to be more elusive because, in part, 

25 unlike cell fate decisions, these steps can be studied meanhigfully only in living 

organisms. The zebrafish, Danio rerio, is a convenient organism to use in genetic and 
biochemical analyses of development. It has an accessible and transparent embryo, 
allowing direct observation of organ function from the earliest stages of development, 
has a short generation time, and is fecund. 



Summary of the Invention 
The invention provides diagnostic, drug screening, and therapeutic methods that 
are based on the observation that a mutation, designated the "heart and soul (has)" 



mutation, in the zebrafish Protein Kinase C % (PKCX) gene, as well as a small molecule 
identified in a chemical screen in zebrafish, concentramide, cause abnormal heart growth 
and development. 

In a first aspect, the invention provides a method of detemiining whether a test 

5 subject (e.g., a mammal, such as a human) has or is at risk of developing a disease or 
condition related to PKCX (e.g., a disease or condition of the heart; also see below). This 
method involves analyzing a nucleic acid molecule of a sample from the test subject to 
determine whether the test subject has a mutation (e.g., the has mutation; see below) in a 
gene encoding PKCX The presence of such a mutation indicates that the test subject has 

10 or is at risk of developing a disease related to FKCk. This method can also involve the 
step of using nucleic acid molecule primers specific for a gene encoding PKQ, for 
nucleic acid molecule amplification of the gene by the polymerase chain reaction. It can 
further involve sequencing a nucleic acid molecule encoding ?KCk from a test subject. 
In a second aspect, the invention provides a method for identifying compounds 

15 that can be used to treat or prevent a disease or condition associated with PKCX, or in the 
preparation of a medicament for use in such methods. This method involves contacting 
an organism (e.g., a zebrafish) having a mutation in a PKCX, gene (e.g., the heart and 
soul mutation), and having a phenotype characteristic of such a disease or condition, 
with the compound, and determining the effect of the compound on the phenotype. 

20 Detection of an improvement in the phenotype indicates the identification of a 

compound that can be used to treat or prevent the disease or condition. In a variation of 
this method, the organism, with or without a mutation in the PKCX, gene (e.g., the has 
mutation), is contacted with a candidate compound in the presence of concentramide. 
In a third aspect, the invention provides a method of treating or preventing a 

25 disease or condition related to PKCX, in a patient (e.g., a patient having a mutation (e.g., 
the heart and soul mutation) in a PKCA, gene), involving administering to the patient a 
compound identified using the method described above. Also included in the invention 
is the use of such compounds in the treatment or prevention of such diseases or 
conditions, as well as the use of these compounds in the preparation of medicaments for 

30 such treatment or prevention. 

In a fourth aspect, the invention provides an additional method of treating or 
preventing a disease or condition related to PKCX in a patient. This method involves 
adnnmstering to the patient a functional PKd protein or a nucleic acid molecule (in, 
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e.g., an expression vector) encoding the protein. Also included in the invention is the 
use of such proteins or nucleic acid molecules in the treatment or prevention of such 
diseases or conditions, as well as the use of these proteins or nucleic acid molecules in 
the preparation of medicaments for such treatment or prevention. 

5 hi a fifth aspect, the invention includes a substantially pure zebrafish ?KCX 

polypeptide. This polypeptide can include or consist essentially of, for example, an 
amino acid sequence that is substantially identical to the amino acid sequence of SEQ ID 
NO:2. The invention also includes variants of these polypeptides that include sequences 
that are at least 75%, 85%, or 95% identical to the sequences of these polypeptides, and 

10 which have PKCX activity or otherwise are characteristic of the diseases and conditions 
mentioned elsewhere herein. Fragments of these polypeptides are also included in the 
invention. For example, fragments that include any of the different domains of PKCX, in 
varying combinations, are included. 

In a sixth aspect, the invention provides an isolated nucleic acid molecule (e.g., a 

15 DNA molecule) including a sequence encoding a zebrafish PKCX, polypeptide. This 
nucleic acid molecule can encode a polypeptide including or consisting essentially of an 
amino sequence that is substantially identical to the amino acid sequence of SEQ ID 
NO:2. The invention also includes nucleic acid molecules that hybridize to the 
complement of SEQ ID NO:l under highly stringent conditions and encode polypeptides 

20 that have VKCk activity or otherwise are characteristic of the diseases and conditions 
mentioned elsewhere herein. 

In a seventh aspect, the invention provides a vector including the nucleic acid 
molecule described above. 

In an eighth aspect, the invention includes a cell including the vector described 

25 above. 

In a ninth aspect, the invention provides a non-human transgenic animal (e.g., a 
zebrafish or a mouse) including the nucleic acid molecule described above. 

In a tenth aspect, the invention provides a non-human animal having a knockout 
mutation in one or both alleles encoding a ?KCX polypeptide. 
30 In an eleventh aspect, the invention includes a cell from the non-human knockout 

animal described above. 



In a twelfth aspect, the invention includes a non-human transgenic animal (e.g., a 
zebrafish) including a nucleic acid molecule encoding a mutant PKCX polypeptide, e.g., 
a polypeptide having the heart and soul mutation. 

In a thirteenm aspect, the invention provides an antibody that specifically binds 

5 to a PKCX polypeptide. 

By "polypeptide" or "polypeptide fragment" is meant a chain of two or more 
(e.g., 10, 15, 20, 30, 50, 100, or 200, or more) amino acids, regardless of any post- 
translational modification (e.g., glycosylafion or phosphorylation), constituting all or part 
of a naturally or non-naturally occurring polypeptide. By "post-translational 

10 modification" is meant any change to a polypeptide or polypeptide fragment during or 
after synthesis. Post-translational modifications can be produced naturally (such as 
during synthesis within a cell) or generated artificially (such as by recombinant or 
chemical means). A "protein" can be made up of one or more polypeptides. 

By "Protein Kinase C X protein," "Protein Kinase C X polypeptide," "PKCX 

15 protein," or "PKCX polypeptide" is meant a polypeptide that has at least 45%, preferably 
at least 60%, more preferably at least 75%, and most preferably at least 90% amino acid 
sequence identity to the sequence of a human (SEQ ID NO: 5) or a zebrafish (SEQ ID 
NO:2) PKCX polypeptide. Polypeptide products from splice variants of PKCX gene 
sequences and PKCX genes containing mutations are also included in this definition. A 

20 PKCX polypeptide as defined herein plays a role in heart development, modeling, and 
function. It can be used as a marker of diseases and conditions associated with PKCX, 
such as heart disease (also see below). 

By a "Protein Kinase C X nucleic acid molecule" or "PKCX nucleic acid 
molecule" is meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA 

25 (e.g., mRNA) molecule, that encodes a PKCX protein (e.g., a human (encoded by SEQ 
ID NO:4) or a zebrafish (encoded by SEQ ID NOs:l or 3) PKCX protein), a PKCX 
polypeptide, or a portion thereof, as defined above. A mutation in a PKCX nucleic acid 
molecule can be characterized, for example, by the insertion of a premature stop codon 
anywhere in the PKCX gene. For example, codon R515 can be changed to a stop codon 

30 (CGA to TGA), or codon W5 19 can be changed to a stop codon (TGG to TAG). In 
addition to this zebrafish Protein Kinase C X mutation (hereinafter referred to as "the 
heart and soul mutation"), the invention includes any mutation that results in aberrant 
PKCX protein production or function, including, only as examples, null mutations and 

4 
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additional mutations causing truncations. The truncations can be carboxyl terminal 
truncations in which the carboxyl terminal half of the protein (or a portion thereof) is not 
produced. For example, at least 10, 25, 50, 70, 75, 100, 150, 200, or 250 amino acids of 
the carboxyl terminal half of the protein can be absent. 

5 The term "identity" is used herein to describe the relationship of the sequence of 

a particular nucleic acid molecule or polypeptide to the sequence of a reference molecule 
of the same type. For example, if a polypeptide or a nucleic acid molecule has the same 
amino acid or nucleotide residue at a given position, compared to a reference molecule 
to which it is aligned, there is said to be "identity" at that position. The level of 

1 o sequence identity of a nucleic acid molecule or a polypeptide to a reference molecule is 
typically measured using sequence analysis software with the default parameters 
specified therein, such as the introduction of gaps to achieve an optimal alignment (e.g., 
Sequence Analysis Software Package of the Genetics Computer Group, University of 
Wisconsin Biotechnology Center, 1710 University Avenue, Madison, WI 53705, 

15 BLAST, or PILEUP/PRETTYBOX programs). These software programs match 

identical or similar sequences by assigning degrees of identity to various substitutions, 
deletions, or other modifications. Conservative substitutions typically include 
substitutions within the following groups: glycine, alanine, valine, isoleucine, and 
leucine; aspartic acid, glutamic acid, asparagine, and glutamine; serine and threonine; 

20 lysine and arginine; and phenylalanine and tyrosine. 

A nucleic acid molecule or polypeptide is said to be "substantially identical" to a 
reference molecule if it exhibits, over its entire length, at least 51%, preferably at least 
55%, 60%, or 65%, and most preferably 75%, 85%, 90%, or 95% identity to the 
sequence of the reference molecule. For polypeptides, the length of comparison 

25 sequences is at least 1 6 amino acids, preferably at least 20 amino acids, more preferably 
at least 25 amino acids, and most preferably at least 35 amino acids. For nucleic acid 
molecules, the length of comparison sequences is at least 50 nucleotides, preferably at 
least 60 nucleotides, more preferably at least 75 nucleotides, and most preferably at least 
110 nucleotides. Of course, the length of comparison can be any length up to and 

30 including full length. 

A PKCX nucleic acid molecule or a PKCX polypeptide is "analyzed" or subject to 
"analysis" if a test procedure is carried out on it that allows the determination of its 
biological activity or whether it is wild type or mutated. For example, one can analyze 



5 



the PKCX genes of an animal (e.g., a human or a zebrafish) by amplifying genomic DNA 
of the animal using the polymerase chain reaction, and then determining whether the 
amplified DNA contains a mutation, for example, the heart and soul mutation, by, e.g., 
nucleotide sequence or restriction fragment analysis. 

5 By "probe" or "primer" is meant a single-stranded DNA or RNA molecule of 

defined sequence that can base pair to a second DNA or RNA molecule that contains a 
complementary sequence (a "target"). The stability of the resulting hybrid depends upon 
the ejctent of the base pairing that occurs. This stability is affected by parameters such as 
the degree of complementarity between the probe and target molecule, and the degree of 

10 stringency of the hybridization conditions. The degree of hybridization stringency is 
affected by parameters such as the temperature, salt concentration, and concentration of 
organic molecules, such as formamide, and is determined by methods that are well 
known to those skilled in the art. Probes or primers specific for PKCX nucleic acid 
molecules, preferably, have greater than 45% sequence identity, more preferably at least 

15 55-75% sequence identity, still more preferably at least 75-85% sequence identity, yet 
more preferably at least 85-99% sequence identity, and most preferably 100% sequence 
identity to the sequences of human (SEQ ID NO:4) or zebrafish (SEQ ID NOs:l and 3) 
?KCl genes. 

Probes can be detectably labeled, either radioactively or non-radioactively, by 
20 methods that are well known to those skilled in the art. Probes can be used for methods 
involving nucleic acid hybridization, such as nucleic acid sequencing, nucleic acid 
amplification by the polymerase chain reaction, single stranded conformational 
polymorphism (SSCP) analysis, restriction fragment polymorphism (RFLP) analysis, 
Southern hybridization, northern hybridization, in situ hybridization, electrophoretic 
25 mobility shift assay (EMSA), and other methods that are well known to those skilled in 
the art. 

A molecule, e.g., an oligonucleotide probe or primer, a gene or fragment thereof, 
a cDNA molecule, a polypeptide, or an antibody, can be said to be "detectably-labeled" 
if it is marked in such a way that its presence can be directly identified in a sample. 
30 Methods for detectably labeling molecules are well known in the art and include, without 
limitation, radioactive labeling (e.g., with an isotope, such as 32 P or 35 S) and 
nonradioactive labeling (e.g., with a fluorescent label, such as fluorescein). 

6 
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By a "substantially pure polypeptide" is meant a polypeptide (or a fragment 
thereof) that has been separated from proteins and organic molecules that naturally 
accompany it. Typically, a polypeptide is substantially pure when it is at least 60%, by 
weight, free from the proteins and naturally occurring organic molecules with which it is 

5 naturally associated. Preferably, the polypeptide is a PKCX, polypeptide that is at least 
75%, more preferably at least 90%, and most preferably at least 99%, by weight, pure. A 
substantially pure PKCX, polypeptide can be obtained, for example, by extraction from a 
natural source, by expression of a recombinant nucleic acid molecule encoding a PKCX 
polypeptide, or by chemical synthesis. Purity can be measured by any appropriate 

10 method, e.g., by column chromatography, polyacrylamide gel electrophoresis, or HPLC 
analysis. 

A polypeptide is substantially free of naturally associated components when it is 
separated from those proteins and organic molecules that accompany it in its natural 
state. Thus, a protein that is chemically synthesized or produced in a cellular system that 

15 is different from the cell in which it is naturally produced is substantially free from its 
naturally associated components. Accordingly, substantially pure polypeptides not only 
include those that are derived from eukaryotic organisms, but also those synthesized in 
E. coli, other prokaryotes, or in other such systems. 

By "isolated nucleic acid molecule" is meant a nucleic acid molecule that is 

20 removed from the environment in which it naturally occurs. For example, a naturally- 
occurring nucleic acid molecule present in the genome of cell or as part of a gene bank is 
not isolated, but the same molecule, separated from the remaining part of the genome, as 
a result of, e.g., a cloning event (amplification), is "isolated." Typically, an isolated 
nucleic acid molecule is free from nucleic acid regions (e.g., coding regions) with which 

25 it is immediately contiguous, at the 5' or 3' ends, in the naturally occurring genome. 

Such isolated nucleic acid molecules can be part of a vector or a composition and still be 
isolated, as such a vector or composition is not part of its natural environment. 

An antibody is said to "specifically bind" to a polypeptide if it recognizes and 
binds to the polypeptide (e.g., a FK.CX polypeptide), but does not substantially recognize 

30 and bind to other molecules (e.g., non-PKCX-related polypeptides) in a sample, e.g., a 
biological sample, which naturally includes the polypeptide. 
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By "high stringency conditions" is meant conditions that allow hybridization 
comparable with the hybridization that occurs using a DNA probe of at least 100, e.g., 
200, 350, or 500, nucleotides in length, in a buffer containing 0.5 M NaHPC-4, pH 7.2, 
7% SDS, 1 mM EDTA, and 1% BSA (fraction V), at a temperature of 65°C, or a buffer 
5 containing 48% formamide, 4.8 x SSC, 0.2 M Tris-Cl, pH 7.6, 1 x Denhardt's solution, 
10% dextran sulfate, and 0.1% SDS, at a temperature of 42°C. (These are typical 
conditions for high stringency northern or Southern hybridizations.) High stringency 
hybridization is also relied upon for the success of numerous techniques routinely 
perfonned by molecular biologists, such as high stringency PCR, DNA sequencing, 
10 single strand conformational polymorphism analysis, and in situ hybridization. In 

contrast to northern and Southern hybridizations, these techniques are usually performed 
with relatively short probes (e.g., usually 16 nucleotides or longer for PCR or 
sequencing, and 40 nucleotides or longer for in situ hybridization). The high stringency 
conditions used in these techniques are well known to those skilled in the art of 
15 molecular biology, and examples of them can be found, for example, in Ausubel et al., 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1998, 
which is hereby incorporated by reference. 

By "sample" is meant a tissue biopsy, amniotic fluid, cell, blood, serum, urine, 
stool, or other specimen obtained from a patient or a test subject. The sample can be 
20 analyzed to detect a mutation in a PKCX, gene, or expression levels of a PKQ, gene, by 
methods that are known in the art. For example, methods such as sequencing, single- 
strand conformational polymorphism (SSCP) analysis, or restriction fragment length 
polymorphism (RFLP) analysis of PCR products derived from a patient sample can be 
used to detect a mutation in aPKQ, gene; ELISA and other immunoassays can be used 
25 to measure levels of a VKCX polypeptide; and PCR can be used to measure the level of a 
PKQ, nucleic acid molecule. 

By "Protein Kinase C ^-related disease," "PKCtarelated disease," "Protein 
Kinase C ^-related condition " or "PKC ^-related condition" is meant a disease or 
condition that results from inappropriately high or low expression of a PKCA, gene, or a 
30 mutation in a PKCk gene (including control sequences, such as promoters) that alters the 
biological activity of a PKCX nucleic acid molecule or polypeptide. PKCX-related 
diseases and conditions can arise in any tissue in which PKCA, is expressed during 

8 



BNSDOCID: <WQ 03023048A2_I_> 



WO 03/023048 PCT/US02/28410 

prenatal or post-natal life. PKCX-related diseases and conditions can include diseases or 
conditions of the heart or cancer (also see below). 

The invention provides several advantages. For example, using the diagnostic 
methods of the invention it is possible to detect an increased likelihood of diseases or 

5 conditions associated with PKCA, such as diseases of the heart or cancer, in a patient, so 
that appropriate intervention can be instituted before any symptoms occur. This may be 
useful, for example, with patients in high-risk groups for such diseases or conditions. 
Also, the diagnostic methods of the invention facilitate determination of the etiology of 
such an existing disease or condition in a patient, so that an appropriate approach to 

10 treatment can be selected. In addition, the screening methods of the invention can be. 
used to identify compounds that can be used to treat or to prevent these diseases or 
conditions. The invention can also be used to treat diseases or conditions (e.g., organ 
failure, such as heart or kidney failure) for which, prior to the invention, the only 
treatment was organ transplantation, which is limited by the availability of donor organs 

15 and the possibility of organ rej ection. 

Other features and advantages of the invention will be apparent from the 
following detailed description, the drawings, and the claims. 

Brief Description of the Drawings 
20 Fig. 1 A is a schematic representation of the structure of a small molecule, 

concentramide, that alters heart patterning. 

Fig. IB is a lateral view of the mushroom-shaped heart of a live, concentramide- 
treated embryo 30 hpf. The atrium is indicated with A, and the ventricle with V. 
Fig. 1C is a schematic representation of a timecourse of concentramide 
25 effectiveness. Black bars indicate the developmental time periods during which groups 
of embryos were immersed in water containing concentramide. An "x" indicates that 
treatment during the indicated time period alters the wild-type brain or heart phenotypes. 
An "o" indicates that the wild-type phenotype was observed. Blue and pink boxes mark 
the critical periods for development of the brain and heart phenotypes, respectively. 
30 Fig. 2 shows that hearts from lias mutant embryos phenocopy hearts from 

concentramide-treated embryos. In situ hybridization was performed with wild-type 
(Figs. 2A-2C), concentramide-treated (Figs. 2D-2F), and has (Figs. 2G-2I) embryos. 
The expression pattern of cardiac myosin light chain 2 (cmlc2) is shown for embryos 24 
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hpf (Figs. 2A, 2D, and 2G) and 30 hpf (Figs. 2B, 2E, and 2H). The relative locations of 
atrium (A) and ventricle (V) were confirmed by 7 um sagital sections of embryos in 
which the ventricle was prestained blue by in situ hybridization to ventricle-specific 
myosin heavy chain (vmhc), followed by staining of the atrium brown with the atrium- 

5 specific antibody S46 (Figs. 2C, 2F, and 21). The view is dorsal, anterior up in Figs. 2A, 
2D, and 2G. The view is lateral, anterior to the left in all other frames. 

Fig. 3A is a map of the has interval with genomic structure of the zebrafish 
PKCX gene. YAC and BAC clones are indicated by addresses beginning with "y" and 
"b." The BAC clone 23cl4 was sequenced to determine the entire genomic structure of 

10 the has gene. From the partial sequence of the BACs listed, a preliminary transcript map 
of the region was determined (see Table 1). The zebrafish PK.CA, gene comprises 18 
exons represented by vertical lines. The site of the mutations associated with the ml29 
and m.567 alleles is indicated with an asterisk. 

Fig. 3B is an anti-PKCA, western blot of protein extracts from wild-type embryos 

15 (WT), has mutant embryos (m567 -/-), and siblings of has mutant embryos (m567 +/+ 
and+/-). 

Figs. 3C-3E show that antisense disruption of PKCX expression phenocopies the 
has mutation. Wild-type embryos (3C), has embryos (3D), and wild-type embryos 
injected with a PKCA. antisense morpholino oligomer (3E) were photographed live 2 

20 days postfertilization. 

Fig. 4 shows that PKCA, is required for lamination, cell polarity, and epithelial 
cell-cell interaction in the retina. Transverse 5 um sections of wild-type (Figs. 4A-4B), 
concentramide-treated (Figs. 4C-4D), and has (Figs. 4E-4F) embryos were stained with 
hematoxylin-eosin 5 days postfertilization (Figs. 4A, 4C, and 4E) or with dapi 30 hpf 

25 (Figs. 4B, 4D, and 4F). Arrowheads indicate mitotic nuclei. Zonula occludens-1 
localization in the retina is shown by 5 u.m transverse sections following staining of 
wild-type (Fig. 4G) or has (Fig. 4H) embryos with an anti-ZO-1 antibody. 

Fig. 5 shows the effects of PKO, inactivation and concentramide treatment on 
polarity of the zebrafish kidney and the C. elegans embryo. An apical kidney marker 

30 (3G8) was used to stain kidneys of wild-type (Fig. 5A), concentramide-treated (Fig. 5B), 
and has (Fig. 5C) embryos. Transverse 2 urn sections of the pronephric duct are shown. 
Figs. 5D and 5E, C. elegans strain KK871, a stable expresser of a par2:GFP fusion 

10 
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protein, was treated with 34 uM concentramide and allowed to develop at room 
temperature. Nomarski (Fig. 5D) and fluorescence (Fig. 5E) microscopy were used to 
visualize the asymmetry of division and par2:GFP localization after the first cell 
division. Posterior is to the left. 

5 Fig. 6 shows alterations in anterior-posterior patterning after treatment with 

concentramide. Figs. 6A-6C, In situ hybridization was used to show Pax2.1 expression 
in untreated (Fig- 6A) and concentramide-treated (Fig. 6B) 18-somite embryos. The 
expression patterns have been false-colored blue for untreated embryos and red for 
concentramide-treated embryos. Fig. 6C shows an overlay of the images from Figs. 6 A 

10 and 6B. Arrowheads indicate areas of Pax2. 1 expression at the midbram-hindbrain 
boundary and in the otic placodes. The view is lateral, anterior to the left in Figs. 6A- 
6C. Fig. 6D, The distance between the anterior edge of the heart field, as defined by 
cmlc2 in situ staining, and the rostral extreme of the zebrafish embryo was measured in 
wild-type (WT), concentramide-treated (cone), and has embryos at the 18-somite stage. 

15 Error bars represent standard error. 

Fig. 7 shows the order of anterior and posterior heart field fusion. Dorsal views 
of cmlc2 expression atthel6-somite (Figs. 7A-7C) and 18-somite (Figs. 7D-7F) stages. 
Expression patterns for wild-type (Fig. 7A and Fig. 7D), concentramide-treated (Fig. 7B 
and Fig. 7E), and has (Fig. 7C and 7F) embryos are shown. Anterior is up. 

20 Fig. 8 is a schematic representation of a model for chamber patterning in the 

zebrafish heart. Normally, the bilateral primordia of the heart field converge and fuse 
first at the posterior end, followed by the anterior end to form a cone. The cone then 
rotates to orient atrial precursors toward the anterior and ventricular precursors toward 
the posterior in an extended heart tube. In concentramide-treated and has mutant 

25 embryos, the fusion order of the ends of the heart field is reversed, proceeding from the 
anterior to the posterior end. Rotation of the cone is blocked, preventing formation of 
the heart tube and causing the concentric heart chamber phenotype. Presumptive atrial 
precursor cells are colored red, ventricular precursor cells are colored blue. Views are 
dorsal; anterior is up. 

30 

Detailed Description 
The invention provides methods of diagnosing, preventing, and treating diseases 
and conditions associated with PKCk, such as diseases or conditions of the heart (also 
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see below), and screening methods for identifying compounds that can he used to treat or 
to prevent such diseases and conditions. In particular, we have identified a small 
molecule, concentramide, and a genetic mutation, heart-and-soul (has), which disrupt 
the earliest heart. Both cause the ventricle to form within the atrium. We show here that 

5 the has gene encodes an atypical Protein Kinase C, Protein Kinase C X (PKCA). The 
has mutation results in the disruption of epithelial cell-cell interactions in a broad range 
of tissues. Concentramide does not disrupt epithelial cell interactions but, rather, shifts 
the converging heart field of developing embryos rostrally. What is shared between the 
effects of concentramide and has is a reversal of the order of fusion of the anterior and 

10 posterior ends of the heart field. 

The diagnostic methods of the invention thus involve detection of mutations in 
genes encoding ¥KCk proteins, while the compound identification methods involve 
screening for compounds that affect the phenotype of organisms having mutations in 
genes encoding PKCX or other models of appropriate diseases and conditions. The 

15 compound identification methods can also involve screening of candidate compounds in 
the presence of concentramide, using organisms with or without a PKCX mutation (e.g., 
the has mutation). Compounds identified in this manner, as well as ?KCk genes and 
proteins themselves, can be used in methods to treat or prevent diseases and conditions 
associated with PKCX-. Compounds, antisense molecules, and antibodies that are found 

20 to inhibit PKCX, function can also be used to prevent or treat cancer. 

The invention also provides animal model systems (e.g., zebrafish having 
mutations (e.g., the heart and soul mutation) in PKCk genes, or mice (or other animals) 
having such mutations) that can be used in the screening methods mentioned above, as 
well as the ?KCk protein, and genes encoding this protein. Also included in the 

25 invention are genes encoding mutant zebrafish PKC^ proteins (e.g., genes having the 
heart and soul mutation) and proteins encoded by these genes. Antibodies that 
specifically bind to these proteins (wild type or mutant) are also included in the 
invention. 

The diagnostic, screening, and therapeutic methods of the invention, as well as 
30 the animal model systems, proteins, and genes of the invention, are described further, as 
follows, after a brief description of diseases and conditions associated with PKCA, which 
can be diagnosed, prevented, or treated according to the invention. 

12 
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PKCX-Associated Diseases or Conditions 

Abnormalities in PKCX genes or proteins can be associated with any of a wide 
variety of diseases or conditions, all of which, can thus be diagnosed, prevented, or 
treated using the methods of the invention. For example, as discussed above, the heart 

5 and soul mutation in zebrafish is characterized by abnormal heart growth and 

development. Thus, detection of abnormalities in PKCX genes or their expression can be 
used in methods to diagnose, or to monitor the treatment or development of, diseases or 
conditions of heart. In addition, compounds that axe identified in the screening methods 
described herein, as well as PKCX nucleic acid molecules, proteins, and antibodies 

10 themselves, can be used in methods to prevent or treat such diseases or conditions. 

Specific examples of diseases or conditions of the heart that can be diagnosed, 
prevented, or treated according to the invention include congenital defects that result in 
heart malformation. These include congenital defects, such as Ebstein anomaly, which 
results in abnormalities of the tricuspid valve, as well as isomerism defects, which are 

15 characterized by a wide variety of abnormalities in the asymmetrical arrangement of 
particular organs, such as the heart, organs of the digestive tract, and the spleen, that 
normally occurs during development. 

In right isomerism sequence, for example, which is also known as asplenia 
syndrome, Ivemark syndrome, and right atrial isomerism, the right side structures of the 

20 heart are duplicated on the left side of the heart, and the spleen is absent. This condition 
can lead to very complex and severe heart defects, such as atrioventricular septal defect 
(AVSD). In contrast, in left isomerism sequence, which is also known as polysplenia 
syndrome, the left side heart structures are duplicated and multiple small spleens may be 
present. This condition can lead to heart defects as well, such as heart block, which 

25 results in a slow heart beat, atrial septal defect, which is characterized by a hole between 
the top two heart chambers, and AVSD. With both types of isomerisms, twisting of the 
bowel or intestinal obstruction may result, due to the incorrect positioning of the 
intestines. Related defects may occur in other organs, such as the kidney. 

Other diseases and conditions related to PKCX that can be diagnosed, prevented, 

30 or treated according to the invention include those that are characterized by 

abnormalities in tight junctions. As is noted above, we have found that abnormalities in 
PKCX, (caused, e.g., by the has mutation) can lead to defects in epithelial cell-cell 
interactions. This is due to abnormalities in the formation of tight junctions, which play 
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critical roles in the sealing of spaces between the individual epithelial or endothelial cells 
that make up sheets of these cells that line the cavities of the body (e.g., the 
gastrointestinal tract, blood vessels, the respiratory tract, and the urinary tract), as well as 
enclose and protect certain organs (e.g., the brain). These sheets of cells function as 
5 selective permeability barriers, and alteration of the permeability of these barriers, due 
to, e.g., a PKCA, defect, can lead to any of a number of diseases or conditions that are 
well known in the art. For example, increased permeability of the lining of the 
gastrointestinal tract can lead to Crohn's disease, acute gastroenteritis, and diarrhea. 
Also, defects in tight junctions can interfere with the critical functions of the blood/brain 
10 barrier or the blood/retina barrier. As an additional example, vascular permeabiHty 
defects in diabetic patients can lead to conditions such as diabetic retinopathy. 
Additional diseases and conditions that can be diagnosed, prevented, or treated, 
according to the invention, include those that are associated with abnormalities in 
epithelial cell polarity, such as polycystic kidney disease (e.g., autosomal dominant 
15 polycystic kidney disease). Also, because we have found that abnormalities in PKCX 
lead to defects in cell growth control, a role for VKCX in cancer is indicated. 
Compounds that are found to modulate PKCX activity, thus, can be used in the 
prevention and treatment of cancer, such as, for example, carcinomas (e.g., renal cell 
carcinoma), which are cancers derived from epithelial cells. 

20 

Diagnostic Methods 

Nucleic acid molecules encoding PKCA, proteins, as well as polypeptides encoded 
by these nucleic acid molecules and antibodies specific for these polypeptides, can be 
used in methods to diagnose or to monitor diseases and conditions involving mutations 
25 in, or inappropriate expression of, genes encoding this protein. 

The diagnostic methods of the invention can be used, for example, with patients 
that have a disease or condition associated with VKCk, in an effort to determine its 
etiology and, thus, to facilitate selection of an appropriate course of treatment. The 
diagnostic methods can also be used with patients who have not yet developed, but who 
30 are at risk of developing, such a disease or condition, or with patients that are at an early 
stage of developing such a disease or condition. Also, the diagnostic methods of the 
invention can be used in prenatal genetic screening, for example, to identify parents who 
may be carriers of a recessive mutation in a gene encoding a VKCk protein. The 
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methods of the invention can be used to diagnose (or to treat) the disorders described 
herein in any mammal, for example, in humans, domestic pets, or livestock. 

Abnormalities in PKCX that can be detected using the diagnostic methods of the 
invention include those characterized by, for example, (i) a gene encoding a PKCX 

5 protein containing a mutation that results in the production of an abnormal PKCX 
protein, (ii) an abnormal PKCX polypeptide itself (e.g., a truncated protein), and (iii) a 
mutation in a PKCX gene that results in production of an abnormal amount of this 
protein. Detection of such abnormalities can be used to diagnose human diseases or 
conditions related to PKCX, , such as those affecting the heart. Exemplary of the 

10 mutations in PKCX genes is the heart and soul mutation, which is described further 
below. 

A mutation in a PKCX gene can be detected in any tissue of a subject, even one in 
which this protein is not expressed. Because of the possibly limited number of tissues in 
which these proteins maybe expressed, for limited time periods, and because of the 

15 possible undesirability of sampling such tissues (e.g., heart tissue) for assays, it may be 
preferable to detect mutant genes in other, more easily obtained sample types, such as in 
blood or amniotic fluid samples. 

Detection of a mutation in a gene encoding a PKCX protein can be carried out 
using any standard diagnostic technique. For example, a biological sample obtained 

20 from a patient can be analyzed for one or more mutations (e.g., a heart and soul 
mutation) in nucleic acid molecules encoding a PKCX protein using a mismatch 
detection approach. Generally, this approach involves polymerase chain reaction (PCR) 
amplification of nucleic acid molecules from a patient sample, followed by identification 
of a mutation (i.e., a mismatch) by detection of altered hybridization, aberrant 

25 electrophoretic gel migration, binding, or cleavage mediated by mismatch binding 

proteins, or by direct nucleic acid molecule sequencing. Any of these techniques can be 
used to facilitate detection of a mutant gene encoding a PKCX protein, and each is well 
known in the art. For instance, examples of these techniques are described by Orita et al. 
(Proc. Natl. Acad. Sci. U.S.A. 86:2766-2770, 1989) and Sheffield et al. (Proc. Natl. 

30 Acad. Sci. U.S.A. 86:232-236, 1989). 

As noted above, in addition to facilitating diagnosis of an existing disease or 
condition, mutation detection assays also provide an opportunity to diagnose a 
predisposition to disease related to a mutation in a PKCX gene before the onset of 
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symptoms. For example, a patient who is heterozygous for a gene encoding an abnormal 
PKCA. protein (or an abnormal amount thereof) that suppresses normal PKCX biological 
activity or expression may show no clinical symptoms of a disease related to such 
proteins, and yet possess a higher than normal probability of developing such disease. 
5 Given such a diagnosis, a patient can take precautions to minimize exposure to adverse 
environmental factors, and can carefully monitor their medical condition, for example, 
through frequent physical examinations. As mentioned above, this type of diagnostic 
approach can also be used to detect a mutation in a gene encoding the PKCX protein in 
prenatal screens. 

10 While it may be preferable to carry out diagnostic methods for detecting a 

mutation in a PKCX gene using genomic DNA from readily accessible tissues, as noted 
above, mPvNA encoding this protein, or the protein itself, can also be assayed from tissue 
samples in which it is expressed. Expression levels of a gene encoding PKCX, in such a 
tissue sample from a patient can be determined by using any of a number of standard 

15 techniques that are well known in the art, including northern blot analysis and 

quantitative PCR (see, e.g., Ausubel et al., supra; PCR Technology: Principles and 
Applications for DNA Amplification, H.A. Ehrlich, Ed., Stockton Press, NY; Yap et al. 
Nucl. Acids. Res. 19:4294, 1991). 

In another diagnostic approach of the invention, an immunoassay is used to 

20 detect or to monitor the level of a PKCX protein in a biological sample. Polyclonal or 
monoclonal antibodies specific for the PKCX protein can be used in any standard 
immunoassay format (e.g., ELISA, Western blot, or RIA; see, e.g., Ausubel et al., supra) 
to measure polypeptide the levels of PKCX. These levels can be compared to levels of 
PKCX in a sample from an unaffected individual. Detection of a decrease in production 

25 of PKCX, using this method, for example, may be indicative of a condition or a 

predisposition to a condition involving insufficient biological activity of the PKCX 
protein. 

Immunohistochemical techniques can also be utilized for detection of PKCX 
protein in patient samples. For example, a tissue sample can be obtained from a patient, 
30 sectioned, and stained for the presence of PKCX using an anti-PKCX antibody and any 
standard detection system (e.g., one that includes a secondary antibody conjugated to an 
enzyme, such as horseradish peroxidase). General guidance regarding such techniques 
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can be found in, e.g., Bancroft et al., Theory and Practice of Histological Techniques, 
Churchill Livingstone, 1982, and Ausubel et al., supra. 

Identification of Molecules that can be used to Treat or to Prevent Diseases or 
5 Conditions Associated with PKCA, 

Identification of a mutation in the gene encoding PKCX, as resulting in a 
phenotype that results in abnormal heart growth and development facilitates the 
identification of molecules (e.g., small organic or inorganic molecules, antibodies, 
peptides, or nucleic acid molecules) that can be used to treat or to prevent diseases or 

10 conditions associated with PKC?i,, as discussed above. The effects of candidate 

compounds on such diseases or conditions can be investigated using, for example, the 
zebrafish system. As is mentioned above, the zebrafish, Danio rerio, is a convenient 
organism to use in the genetic analysis of development. It has an accessible and 
transparent embryo, allowing direct observation of organ function from the earliest 

15 stages of development, has a short generation time, and is fecund. As discussed further 
below, zebrafish and other animals having a PKC?i, mutation, such as the heart and soul 
mutation, which can be used in these methods, are also included in the invention. 

In one example of the screening methods of the invention, a zebrafish having a 
mutation in a gene encoding the PKCX, protein (e.g., a zebrafish having the heart and 

20 soul mutation) is contacted with a candidate compound, and the effect of the compound 
on the development of a heart abnormality, or on the status of such an existing 
abnormality, is monitored relative to an untreated, identically mutant control. In a 
variation of this method, a zebrafish, with or without a mutation in the PKCX gene (e.g., 
the has mutation), is contacted with a candidate compound in the presence of 

25 concentramide. 

After a compound has been shown to have a desired effect in the zebrafish 
system, it can be tested in other models of heart disease, for example, in mice or other 
animals having a mutation in a gene encoding PKCL Alternatively, testing in such 
animal model systems can be carried out in the absence of zebrafish testing. Compounds 

30 of the invention can also be tested in animal models of cancer. 

Cell culture-based assays can also be used in the identification of molecules that 
increase or decrease PKCA, levels or biological activity. According to one approach, 
candidate molecules are added at varying concentrations to the culture medium of cells 
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expressing PKCX rnKNA. PKC^ biological activity is then measured using standard 
techniques. The measurement of biological activity can include the measurement of 
PKCX protein and nucleic acid molecule levels. 

In general, novel drugs for die prevention or treatment of diseases related to 

5 mutations in genes encoding PKCX can be identified from large libraries of natural 
products, synthetic (or semi-synthetic) extracts, and chemical libraries using methods 
that are well known in the art. Those skilled in the field of drug discovery and 
development will understand that the precise source of test extracts or compounds is not 
critical to the screening methods of the invention and that dereplication, or the 

10 elimination of replicates or repeats of materials already known for their therapeutic 
activities for PKCX,, can be employed whenever possible. 

Candidate compounds to be tested include purified (or substantially purified) 
molecules or one or more components of a mixture of compounds (e.g., an extract or 
supernatant obtained from cells; Ausubel et al., supra), and such compounds further 

15 include both naturally occurring or artificially derived chemicals and modifications of 
existing compounds. For example, candidate compounds can be polypeptides, 
synthesized organic or inorganic molecules, naturally occurring organic or inorganic 
molecules, nucleic acid molecules, and components thereof. 

Numerous sources of naturally occurring candidate compounds are readily 

20 available to those skilled in the art. For example, naturally occurring compounds can be 
found in cell (including plant, fungal, prokaryotic, and animal) extracts, mammalian 
serum, growth medium in which mammalian cells have been cultured, protein 
expression libraries, or fermentation broths. In addition, libraries of natural compounds 
in the form of bacterial, fungal, plant, and animal extracts are commercially available 

25 from a number of sources, including Biotics (Sussex, UK), Xenova (Slough, UK), 
Harbor Branch Oceanographic Institute (Ft. Pierce, FL), and PharmaMar, U.S. A. 
(Cambridge, MA). Furthermore, libraries of natural compounds can be produced, if 
desired, according to methods that are known in the art, e.g., by standard extraction and 
fractionation. 

30 Artificially derived candidate compounds are also readily available to those 

skilled in the art. Numerous methods are available for generating random or directed 
synthesis (e.g., semi-synthesis or total synthesis) of any number of chemical compounds, 
mcluding, for example, saccharide-, lipid-, peptide-, and nucleic acid molecule-based 
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compounds. la addition, synthetic compound libraries are commercially available from 
Brandon Associates (Merrimack, NH) and Aldrich Chemicals (Milwaukee, WI). 
Libraries of synthetic compounds can also be produced, if desired, according to methods 
known in the art, e.g., by standard extraction and fractionation. Furthermore, if desired, 

5 any library or compound can be readily modified using standard chemical, physical, or 
biochemical methods. The techniques of modern synthetic chemistry, including 
combinatorial chemistry, can also be used (reviewed in Schreiber, Bioorganic and 
Medicinal Chemistry 6:1172-1152, 1998; Schreiber, Science 287:1964-1969, 2000). 
When a crude extract is found to have an effect on the development or 

10 persistence of a PKCX-associated disease, further fractionation of the positive lead 
extract can be carried out to isolate chemical constituents responsible for the observed 
effect. Thus, the goal of the extraction, fractionation, and purification process is the 
careful characterization and identification of a chemical entity within the crude extract 
having a desired activity. The same assays described herein for the detection of 

15 activities in mixtures of compounds can be used to purify the active component and to 
test derivatives of these compounds. Methods of fractionation and purification of such 
heterogeneous extracts are well known in the art. If desired, compounds shown to be 
useful agents for treatment can be chemically modified according to methods known in 
the art. 

20 In general, compounds that are found to activate PKCX. expression or activity 

may be used in the prevention or treatment of diseases or conditions of heart, such as 
those that are characterized by abnormal growth or development, or heart failure (also 
see above). Compounds that are found to modulate, e.g., block PKCX expression or 
activity may be used to prevent or to treat cancer. 

25 

Animal Model Systems 

The invention also provides animal model systems for use in carrying out the 
screening methods described above. Examples of these model systems include zebrafish 
and other animals, such as mice, that have a mutation (e.g., the heart and soul mutation) 
30 in a YKCk gene. For example, a zebrafish model that can be used in the invention can 
include a mutation that results in a lack of PKC^ protein production or production of a 
truncated (e.g., by introduction of a stop codon) or otherwise altered PKCX gene product. 
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As a specific example, a zebrafish having the heart and soul mutation can be used (see 
below). 

Treatment or Prevention of PKCX -Associated Diseases or Conditions 

5 Compounds identified using the screening methods described above can be used 

to treat patients that have or are at risk of developing diseases or conditions of the heart 
or cancer. Nucleic acid molecules encoding the PKCX protein, as well as these proteins 
themselves, can also be used in such methods. Treatment may be required only for a 
short period of time or may, in some form, be required throughout a patient's Hfetime. 

10 Any continued need for treatment, however, can be determined using, for example, the 
diagnostic methods described above. In considering various therapies, it is to be 
understood that such therapies are, preferably, targeted to the affected or potentially 
affected organ (e.g., the heart). Such targeting can be achieved using standard methods. 
Treatment or prevention of diseases resulting from a mutated PKCX gene can be 

15 accomplished, for example, by modulating the function of a mutant PKCX protein. 

Treatment can also be accomplished by delivering normal PKCX protein to appropriate 
cells, altering the levels of normal or mutant PKCX protein, replacing a mutant gene 
encoding a PKCX protein with a normal gene encoding a PKCX protein, or administering 
a normal gene encoding a PKCX protein. It is also possible to correct the effects of a 

20 defect in a gene encoding a PKCX protein by modifying the physiological pathway (e.g., 
a signal transduction pathway) in which a PKCX protein participates. 

In a patient diagnosed as being heterozygous for a gene encoding a mutant PKCX 
protein, or as susceptible to such mutations or aberrant PKCX expression (even if those 
mutations or expression patterns do not yet result in alterations in expression or 

25 biological activity of PKCX), any of the therapies described herein can be administered 
before the occurrence of the disease phenotype. In particular, compounds shown to have 
an effect on the phenotype of mutants, or to modulate expression of PKCX proteins, can 
be administered to patients diagnosed with potential or actual disease by any standard 
dosage and route of administration. 

30 Any appropriate route of administration can be employed to administer a 

compound identified as described above, a PKCX gene, or a PKCX protein, according to 
the invention. For example, administration can be parenteral, intravenous, intra-arterial, 

20 
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subcutaneous, intramuscular, intraventricular, intracapsular, intraspinal, intracisternal, 
intraperitoneal, intranasal, by aerosol, by suppository, or oral. 

A therapeutic compound of the invention can be administered within a 
pharmaceutically acceptable diluent, carrier, or excipient, in unit dosage form. 

5 Administration can begin before or after the patient is symptomatic. Methods that are 
well known in the art for making formulations are found, for example, in Remington's 
Pharmaceutical Sciences (18 th edition), ed. A. Gennaro, 1990, Mack Publishing 
Company, Easton, PA. Therapeutic formulations can be in the form of liquid solutions 
or suspensions. Formulations for parenteral acrministration can contain, for example, 

10 excipients, sterile water, or saline; polyalkylene glycols, such as polyethylene glycol; oils 
of vegetable origin; or hydrogenated napthalenes. Biocompatible, biodegradable lactide 
polymer, lactide/glycolide copolymer, or polyoxyethylene-polyoxypropylene copolymers 
can be used to control the release of the compounds. Other potentially useful parenteral 
delivery systems include ethylene-vinyl acetate copolymer particles, osmotic pumps, 

15 implantable infusion systems, and liposomes. For oral administration, formulations can 
be in the form of tablets or capsules. Formulations for inhalation can contain excipients, 
for example, lactose, or can be aqueous solutions containing, for example, 
polyoxyethylene-9-lauryl ether, glycocholate, and deoxycholate, or can be oily solutions 
for administration in the form of nasal drops or as a gel. Alternatively, intranasal 

20 formulations can be in the form of powders or aerosols. 

To replace a mutant protein with normal protein, or to add protein to cells that do 
not express a sufficient amount of PKCX, or normal PKCX, it may be necessary to obtain 
large amounts of pure PKCX, protein from cell culture systems in which the protein is 
expressed (see, e.g., below). Delivery of the protein to the affected tissue can then be 

25 accomplished using appropriate packaging or adarinistration systems. 

Gene therapy is another therapeutic approach for preventing or ameliorating 
diseases caused by PKCX, gene defects. Nucleic acid molecules encoding wild type 
PKCX, protein can be delivered to cells that lack sufficient, normal PKCX protein 
biological activity (e.g., cells carrying mutations (e.g., the heart and soul mutation) in 

30 PKCX, genes). The nucleic acid molecules must be delivered to those cells in a form in 
which they can be taken up by the cells and so that sufficient levels of protein, to provide 
effective PKCX, protein function, can be produced. Alternatively, for some PKCX 
mutations, it may be possible to slow the progression of the resulting disease or to 
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modulate PKC1 protein activity by introducing another copy of a homologous gene 
bearing a second mutation in that gene, to alter the mutation, or to use another gene to 
block any negative effect. 

Transducing viral (e.g., retroviral, adenoviral, and adeno-associated viral) vectors 
5 can be used for somatic cell gene therapy, especially because of their high efficiency of 
infection and stable integration and expression (see, e.g., Cayouette et al., Human Gene 
Therapy 8:423-430, 1997; Kido et al., Current Eye Research 15:833-844, 1996; Bloomer 
et al., Journal of Virology 71:6641-6649, 1997; Naldini et al., Science 272:263-267, 
1996; andMiyoshi et al., Proc. Natl. Acad. Sci. U.S.A. 94:10319, 1997). For example, 
10 the full length PKCX gene, or a portion thereof, can be cloned into a retroviral vector and 
expression can be driven from its endogenous promoter, from the retroviral long 
terminal repeat, or from a promoter specific for a target cell type of interest. Other viral 
vectors that can be used include, for example, a vaccinia virus, a bovine papilloma virus, 
or a herpes vims, such as Epstein-Barr Virus (also see, for example, the vectors of 
15 Miller, Human Gene Therapy 15-14, 1990; Friedman, Science 244:1275-1281, 1989; 
Eglitis et al., BioTechniques 6:608-614, 1988; Tolstoshev et al., Current Opinion in 
Biotechnology 1:55-61, 1990; Sharp, The Lancet 337:1277-1278, 1991; Cornetta et al., 
Nucleic Acid Research and Molecular Biology 36:31 1-322, 1987; Anderson, Science 
'226:401-409, 1984; Moen, Blood Cells 17:407-416, 1991; Miller et al., Biotechnology 
20 7:980-990, 1989; Le Gal La Salle et al., Science 259:988-990, 1993; and Johnson, Chest 
107:77S-83S, 1995). Retroviral vectors are particularly well developed and have been 
used in clinical settings (Rosenberg et al., N. Engl. J. Med 323:370, 1990; Anderson et 
al.,U.S. Patent No. 5,399,346). 

Non-viral approaches can also be employed for the introduction of therapeutic 
25 DNA into cells predicted to be subject to diseases involving the PKCX, protein. For 
example, a ?KCk nucleic acid molecule or an antisense nucleic acid molecule can be 
introduced into a cell by lipofection (Feigner et al., Proc. Natl. Acad. Sci. U.S.A. 
84:7413, 1987; Ono et al., Neuroscience Letters 17:259, 1990; Brigham et al, Am. J. 
Med. Sci. 298:278, 1989; Staubinger et al., Methods in Enzymology 101:512, 1983), 
30 asialoorosomucoid-polylysine conjugation (Wu et al., Journal of Biological Chemistry 
263:14621, 1988; Wu et al, Journal of Biological Chemistry 264:16985, 1989), or by 
micro-injection under surgical conditions (Wolff et al., Science 247:1465, 1990). 
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Gene transfer can also be achieved using non-viral means involving transfection 
in vitro. Such methods include the use of calcium phosphate, DEAE dextran, 
electroporation, and protoplast fusion. Liposomes can also be potentially beneficial for 
delivery of DNA into a cell. Transplantation of normal genes into the affected tissues of 

5 a patient can also be accomplished by trarisferring a normal FKCk protein into a 

cultivatable cell type ex vivo (e.g., an autologous or heterologous primary cell or progeny 
thereof), after which the cell (or its descendants) are injected into a targeted tissue. 

YKCk cDNA expression for use in gene therapy methods can be directed from 
any suitable promoter (e.g., the human cytomegalovirus (CMV), simian virus 40 (SV40), 

10 or metallothionein promoters), and regulated by any appropriate mammalian regulatory 
element. For example, if desired, enhancers known to preferentially direct gene 
expression in specific cell types can be used to direct PICO, expression. The enhancers 
used can include, without limitation, those that are characterized as tissue- or cell- 
specific enhancers. Alternatively, if a PKCX genomic clone is used as a therapeutic 

15 construct (such clones can be identified by hybridization with YKCk cDNA, as described 
herein), regulation can be mediated by the cognate regulatory sequences or, if desired, by 
regulatory sequences derived from a heterologous source, including any of the promoters 
or regulatory elements described above. 

Molecules for effecting antisense-based strategies can be employed to explore 

20 PKCX protein gene function, as a basis for therapeutic drug design, as well as to treat 
PKCk-associated diseases, such as cancer. These strategies are based on the principle 
that sequence-specific suppression of gene expression (via transcription or translation) 
can be achieved by intracellular hybridization between genomic DNA or mRNA and a 
complementary antisense species. The formation of a hybrid RNA duplex interferes 

25 with transcription of the target PKCX-encoding genomic DNA molecule, or processing, 
transport, translation, or stability of the target ?KCX rnENA molecule. 

Antisense strategies can be delivered by a variety of approaches. For example, 
antisense oligonucleotides or antisense RNA can be directly acmiinistered (e.g., by 
intravenous injection) to a subject in a form that allows uptake into cells. Alternatively, 

30 viral or plasmid vectors that encode antisense RNA (or antisense RNA fragments) can be 
introduced into a cell in vivo or ex vivo. Antisense effects can be induced by control 
(sense) sequences; however, the extent of phenotypic changes is highly variable. 
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Phenotypic effects induced by antisense molecules are based on changes in criteria such 
as protein levels, protein activity measurement, and target mKNA levels. 

PKCX gene therapy can also be accomplished by direct administration of 
antisense PKCX mRNA to a cell that is expected to be adversely affected by the 
5 expression of wild type or mutant PKCX protein. The antisense PKCX mRNA can be 
produced and isolated by any standard technique, but is most readily produced by in viti-o 
transcription using an antisense PKCX cDNA under the control of a high efficiency 
promoter (e.g., the T7 promoter). Admimstration of antisense PKCX mRNA to cells can 
be carried out by any of the methods for direct nucleic acid molecule administration 
10 described above. 

An alternative strategy for inhibiting PKCX protein function using gene therapy 
involves intracellular expression of an anti-PKCX protein antibody or a portion of an 
anti-PKCX protein antibody. For example, the gene (or gene fragment) encoding a 
monoclonal antibody that specifically binds to a PKCX protein and inhibits its biological 
15 activity can be placed under the transcriptional control of a tissue-specific gene 
regulatory sequence. 

Another therapeutic approach included in the invention involves administration 
of a recombinant PKCX polypeptide, either directly to the site of a potential or actual 
disease-affected tissue (for example, by injection) or systemically (for example, by any 
20 conventional recombinant protein adrninistration technique) . The dosage of the PKCX 
protein depends on a number of factors, including the size and health of the individual 
patient but, generally, between 0.1 mg and 100 mg, inclusive, is administered per day to 
an adult in any pharmaceutically acceptable formulation. 

In addition to the therapeutic methods described herein, involving aclministration 
25 of PKCX-modulating compounds, PKCX proteins, or PKCX nucleic acids to patients, the 
invention provides methods of culturing organs in the presence of such molecules. In 
particular, as is noted above, a PKCX mutation is associated with abnormal heart growth 
and development Thus, culturing heart tissue in the presence of these molecules can be 
used to promote its growth and development. This tissue can be that which is being 
30 prepared for transplant from, e.g., an allogeneic or xenogeneic donor, as well as synthetic 
tissue or organs. 
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Synthesis of PKCX Proteins. Polypeptides, and Polypeptide Fragments 

Those skilled in the art of molecular biology will understand that a wide variety 
of expression systems can be used to produce recombinant PKCX. proteins. As discussed 
further below, the precise host cell used is not critical to the invention. The PKCX. 

5 proteins can be produced in a prokaryotic host (e.g., E. coli) or in a eukaryotic host (e.g., 
S. cerevisiae, insect cells, such as Sf9 cells, or mammalian cells, such as COS-1, NIH 
3T3, or HeLa cells). These cells are commercially available from, for example, the 
American Type Culture Collection, Manassas, VA (see also Ausubel et al., supra). The 
method of transformation and the choice of expression vehicle (e.g., expression vector) 

10 will depend on the host system selected. Transformation and transfection methods are 
described, e.g., in Ausubel et al., supra, and expression vehicles can be chosen from 
those provided, e.g., in Pouwels et al., Cloning Vectors: A Laboratory Manual, 1985, 
Supp. 1987. Specific examples of expression systems that can be used in the invention 
are described further as follows. 

15 For protein expression, eukaryotic or prokaryotic expression systems can be 

generated in which PKCX, gene sequences are introduced into a plasmid or other vector, 
which is then used to transform living cells. Constructs in which full-length FKCk 
cDNAs, containing the entire open reading frame, inserted in the correct orientation into 
an expression plasmid, can be used for protein expression. Alternatively, portions of 

20 PKCX gene sequences, including wild type or mutant PK.CK sequences, can be inserted. 
Prokaryotic and eukaryotic expression systems allow various important functional 
domains of PKC^ proteins to be recovered, if desired, as fusion proteins, and then used 
for bmding, structural, and functional studies, and also for the generation of antibodies. 
Typical expression vectors contain promoters that direct synthesis of large 

25 amounts of mRNA corresponding to a nucleic acid molecule that has been inserted into 
the vector. They can also include a eukaryotic or prokaryotic origin of replication, 
allowing for autonomous replication within a host cell, sequences that confer resistance 
to an otherwise toxic drug, thus allowing vector-containing cells to be selected in the 
presence of the drug, and sequences that increase the efficiency with which the 

30 synthesized mRNA is translated. Stable, long-term vectors can be maintained as freely 
replicating entities by using regulatory elements of, for example, viruses (e.g., the OriP 
sequences from the Epstein Barr Virus genome). Cell lines can also be produced that 
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have the vector integrated into genomic DNA of the cells and, in this manner, the gene 
product can be produced in the cells on a continuous basis. 

Expression of foreign molecules in bacteria, such as Escherichia coli, requires 
the insertion of a foreign nucleic acid molecule, e.g., a PKCX nucleic acid molecule, into 

5 a bacterial expression vector. Such plasmid vectors include several elements required 
for the propagation of the plasmid in bacteria, and for expression of foreign DNA 
contained within the plasmid. Propagation of only plasmid-bearing bacteria is achieved 
by introducing, into the plasmid, a selectable marker-encoding gene that allows plasmid- 
bearing bacteria to grow in the presence of an otherwise toxic drug. The plasmid also 

10 contains a transcriptional promoter capable of directing synthesis of large amounts of 
mRNA from the foreign DNA. Such promoters can be, but are not necessarily, inducible 
promoters that initiate transcription upon induction by culture under appropriate 
conditions (e.g., in the presence of a drug that activates the promoter). The plasmid also, 
preferably, contains a polylinker to simplify insertion of the gene in the correct 

15 orientation vrithin the vector. 

Once an appropriate expression vector containing a PKCX gene, or a fragment, 
fusion, or mutant thereof, is constructed, it can be introduced into an appropriate host 
cell using a transformation technique, such as, for example, calcium phosphate 
transfection, DEAE-dextran transfection, electroporation, microinjection, protoplast 

20 fusion, or liposome-mediated transfection. Host cells that can be transfected with the 
vectors of the invention can include, but are not limited to, E, coli or other bacteria, 
yeast, fungi, insect cells (using, for example, baculoviral vectors for expression), or cells 
derived from mice, humans, or other animals. Mammalian cells can also be used to 
express PKCX proteins using a virus expression system (e.g., a vaccinia virus expression 

25 system) described, for example, in Ausubel et al., supra. 

In vitro expression of PKCX proteins, fusions, polypeptide fragments, or mutants 
encoded by cloned DNA can also be carried out using the T7 late-promoter expression 
system. This system depends on the regulated expression of T7 RNA polymerase, an 
enzyme encoded in the DNA of bacteriophage T7. The T7 RNA polymerase initiates 

30 transcription at a specific 23 base pair promoter sequence called the T7 late promoter. 
Copies of the T7 late promoter are located at several sites on the T7 genome, but none 
are present in E. coli chromosomal DNA. As a result, in T7-infected E. coli, T7 RNA 
polymerase catalyzes transcription of viral genes, but not E. coli genes. In this 
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expression system, recombinant £. coli cells are first engineered to carry the gene 
encoding T7 RNA polymerase next to the lac promoter. In the presence of IPTG, these 
cells transcribe the T7 polymerase gene at a high rate and synthesize abundant amounts 
of T7 RNA polymerase. These cells are then transformed with plasmid vectors that • 
5 carry a copy of the T7 late promoter protein. When IPTG is added to the culture medium 
containing these transformed E. coli cells, large amounts of T7 RNA polymerase are 
produced. The polymerase then binds to the T7 late promoter on the plasmid expression 
vectors, catalyzing transcription of the inserted cDNA at a high rate. Since each E. coli 
cell contains many copies of the expression vector, large amounts of mRNA 
10 corresponding to the cloned cDNA can be produced in this system and the resulting 
protein can be radioactively labeled. 

Plasmid vectors containing late promoters and the corresponding RNA 
polymerases from related bacteriophages, such as T3, T5, and SP6, can also be used for 
in vitro production of proteins from cloned DNA. E. coli can also be used for expression 
15 using an Ml 3 phage, such as mGPI-2. Furthermore, vectors that contain phage lambda 
regulatory sequences, or vectors that direct the expression of fusion proteins, for 
example, a maltose-binding protein fusion protein or a glutathione-S-transferase fusion 
protein, also can be used for expression in E. coli. 

Eukaryotic expression systems are useful for obtaining appropriate post- 
20 translational modification of expressed proteins. Transient transfection of a eukaryotic 
expression plasmid containing a PKCX gene into a eukaryotic host cell allows the 
transient production of a PKCX, protein by the transfected host cell. PKC^, proteins can 
also be produced by a stably-transfected eukaryotic (e.g., mammalian) cell line. A 
number of vectors suitable for stable transfection of mammalian cells are available to the 
25 public (see, e.g., Pouwels et al., supra), as are methods for constructing lines including 
such cells (see, e.g., Ausubel et al., supra). 

In one example, cDNA encoding a PKC^. protein, fusion, mutant, or polypeptide 
fragment is cloned into an expression vector that includes the dihydrofolate reductase 
(DHFR) gene. Integration of the plasmid and, therefore, integration of the heart and 
30 soul protein-encoding gene, into the host cell chromosome is selected for by inclusion of 
0.01-300 uM methotrexate in the cell culture medium (Ausubel et al., supra). This 
dominant selection can be accomplished in most cell types. Recombinant protein 
expression can be increased by DHFR-mediated amplification of the transfected gene. 
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Methods for selecting cell lines bearing gene amplifications are described in Ausubel et 
al., supra. These methods generally involve extended culture in medium containing 
gradually increasing levels of methotrexate. The most commonly used DHFR- 
containing expression vectors are pCVSEH-DHFR andpAdD26SV(A) (described, for 

5 example, in Ausubel et al., supra). The host cells described above or, preferably, a 
DHFR-deficient CHO cell line (e.g., CHO DHFR- cells, ATCC Accession No. CRL 
9096) are among those that are most preferred for DHFR selection of a stably transfected 
cell line or DHFR-mediated gene amplification. 

Another preferred eulcaryotic expression system is the baculovirus system using, 

10 for example, the vector pBacPAK9, which is available from Clontech (Palo Alto, CA). 
If desired, this system can be used in conjunction with other protein expression 
techniques, for example, the myc tag approach described by Evan et al. (Molecular and 
Cellular Biology 5:3610-3616, 1985). 

Once a recombinant protein is expressed, it can be isolated from the expressing 

l 

15 cells by cell lysis followed by protein purification techniques, such as affinity 

chromatography. In this example, an anti-PKCX antibody, which can be produced by the 
methods described herein, can be attached to a column and used to isolate the 
recombinant PKCX Lysis and fractionation of PKCX-harboring cells prior to affinity 
chromatography can be performed by standard methods (see, e.g., Ausubel et al., supra). 

20 Once isolated, the recombinant protein can, if desired, be purified further by, e.g., high 
performance liquid chromatography (HPLC; e.g., see Fisher, Laboratory Techniques In 
Biochemistry and Molecular Biology, Work and Burdon, Eds., Elsevier, 1980). 

Polypeptides of the invention, particularly short PKC1 fragments and longer 
fragments of the N-terminus and C-terminus of PKCX, can also be produced by chemical 

25 synthesis (e.g., by the methods described in Solid Phase Peptide Synthesis, 2 nd ed., 1984, 
The Pierce Chemical Co., Rockford, IL). These general techniques of polypeptide 
expression and purification can also be used to produce and isolate useful PKCX 
fragments or analogs, as described herein. 

30 PKCA, Protein Fragments 

Polypeptide fragments that include various portions of PKCX proteins are useful 
in identifying the domains of PKCXthat are important for its biological activities. 
Methods for generating such fragments are well known in the art (see, for example, 
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Ausubel et al., supra), using the nucleotide sequences provided herein. For example, a 
PKCX protein fragment can be generated by PCR amplifying a desired PKCX nucleic 
acid molecule fragment using oligonucleotide primers designed based upon PKCX 
nucleic acid sequences. Preferably, the oligonucleotide primers include unique 

5 restriction enzyme sites that facilitate insertion of the amplified fragment into the cloning 
site of an expression vector (e.g., a mammalian expression vector, see above). This 
vector can then be introduced into a cell (e.g., a mammalian cell; see above) by artifice, 
using any of the various techniques that are known in the art, such as those described 
herein, resulting in the production of a PKCX protein fragment in the cell containing the 

10 expression vector. PKCX protein fragments (e.g., chimeric fusion proteins) can also be 
used to raise antibodies specific for various regions of the PKCX protein using, for 
example, the methods described below. 

PKCX Protein Antibodies 

15 To prepare polyclonal antibodies, PKCX proteins, fragments of PKCX proteins, or 

fusion proteins containing defined portions of PKCX proteins can be synthesized in, e.g., 
bacteria by expression of corresponding DNA sequences contained in a suitable cloning 
vehicle. Fusion proteins are commonly used as a source of antigen for producing 
antibodies. Two widely used expression systems for E. coli are lacZ fusions using the 

20 pUR series of vectors and trpE fusions using the pATH vectors. The proteins can be 
purified, coupled to a carrier protein, mixed with Freund's adjuvant to enhance 
stimulation of the antigenic response in an inoculated animal, and injected into rabbits or 
other laboratory animals. Alternatively, protein can be isolated from PKCX-expressing 
cultured cells. Following booster injections at bi-weeldy intervals, the rabbits or other 

25 laboratory animals are then bled and the sera isolated. The sera can be used directly or 
can be purified prior to use by various methods, including affinity chromatography 
employing reagents such as Protein A-Sepharose, antigen-Sepharose, and anti-mouse-Ig- 
Sepharose. The sera can then be used to probe protein extracts from PKCX-expressing 
tissue fractionated by polyacrylamide gel electrophoresis to identify PKCX proteins. 

30 Alternatively, synthetic peptides can be made that correspond to antigenic portions of the 
protein and used to inoculate the animals. 
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To generate peptide or full-length protein for use in making, for example, PKCX- 
specific antibodies, a PKCX coding sequence can be expressed as a C-terrninal or N- 
terrninal fusion with glutathione S-transferase (GST; Smith et al., Gene 67:31-40, 1988). 
The fusion protein can be purified on glutathione-Sepharose beads, eluted with 

5 glutathione, cleaved with a protease, such as thrombin or Factor-Xa (at the engineered 
cleavage site), and purified to the degree required to successfully immunize rabbits. 
Primary immunizations can be carried out with Freund's complete adjuvant and 
subsequent immunizations performed with Freund's incomplete adjuvant. Antibody 
titers can be monitored by Western blot and immunoprecipitation analyses using the 

10 protease-cleaved PKCX fragment of the GST-PKCX protein. Immune sera can be affinity 
purified using CNBr-Sepharose-coupled PKCX. Antiserum specificity can be 
determined using a panel of unrelated GST fusion proteins. 

Alternatively, monoclonal PKCX antibodies can be produced by using, as an 
antigen, PKCX, isolated from PKCX-expressing cultured cells or PKCX protein isolated 

15 from tissues. The cell extracts, or recombinant protein extracts containing PKCX, can, 
for example, be injected with Freund's adjuvant into mice. Several days after being 
injected, the mouse spleens can be removed, the tissues disaggregated, and the spleen 
cells suspended in phosphate buffered saline (PBS). The spleen cells serve as a source 
of lymphocytes, some of which would be producing antibody of the appropriate 

20 specificity. These can then be fused with permanently growing myeloma partner cells, 
and the products of the fusion plated into a number of tissue culture wells in the presence 
of selective agents, such as hypoxanthine, arninopterine, and thymidine (HAT). The 
wells can then be screened by ELISA to identify those containing cells making 
antibodies capable of binding to PKCX, polypeptide fragment, or mutant thereof. These 

25 cells can then be re-plated and, after a period of growth, the wells containing these cells 
can be screened again to identify antibody-producing cells. Several cloning procedures 
can be carried out until over 90% of the wells contain single clones that are positive for 
specific antibody production. From this procedure, a stable line of clones that produce 
the antibody can be established. The monoclonal antibody can then be purified by 

30 affmity chromatography using Protein A Sepharose and ion exchange chromatography, 
as well as variations and combinations of these techniques. Once produced, monoclonal 
antibodies are also tested for specific PKCX recognition by Western blot or 
immunoprecipitation analysis (see, e.g., Kohler et al., Nature 256:495, 1975; Kohler et 
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al, European Journal of Immunology 6:511, 1976; Kohler et al., European Journal of 
Immunology 6:292, 1976; Hammerling et al., In Monoclonal Antibodies and T Cell 
Hybridomas, Elsevier, New York, NY, 1981; Ausubel et al., supra). 

As an alternate or adjunct immunogen to GST fusion proteins, peptides 

5 corresponding to relatively unique hydrophilic regions of PKCA, can be generated and 
coupled to keyhole limpet hemocyanin (KLH) through an introduced C-terminal lysine. 
Antiserum to each of these peptides can be similarly affinity-purified on peptides 
conjugated to BSA, and specificity tested by ELISA and Western blotting using peptide 
conjugates, and by Western blotting and immunoprecipitation using PKQ, for example, 

10 expressed as a GST fusion protein. 

Antibodies of the invention can be produced using PKCX amino acid sequences 
that do not reside within highly conserved regions, and that appear likely to be antigenic, 
as analyzed by criteria such as those provided by the Peptide Structure Program 
(Genetics Computer Group Sequence Analysis Package, Program Manual for the GCG 

15 Package, Version 7, 1991) using the algorithm of Jameson et al., CABIOS 4:181, 1988. 
These fragments can be generated by standard techniques, e.g., by PCR, and cloned into 
the pGEX expression vector. GST fusion proteins can be expressed in E. coli and 
purified using a glutathione-agarose affinity matrix (Ausubel et al., supra). To generate 
rabbit polyclonal antibodies, and to minimize the potential for obtairiing antisera that is 

20 non-specific, or exhibits low-affinity binding to YKCk, two or three fusions are 

generated for each protein, and each fusion is injected into at least two rabbits. Antisera 
are raised by injections in series, preferably including at least three booster injections. 

In addition to intact monoclonal and polyclonal anti-PKCA. antibodies, the 
invention features various genetically engineered antibodies, humanized antibodies, and 

25 antibody fragments, mcluding F(ab')2, Fab', Fab, Fv, and sFv fragments. Truncated 
versions of monoclonal antibodies, for example, can be produced by recombinant 
methods in which plasmids are generated that express the desired monoclonal antibody 
fragments) in a suitable host. Antibodies can be humanized by methods known in the 
art, e.g., monoclonal antibodies with a desired binding specificity can be commercially 

30 humanized (Scotgene, Scotland; Oxford Molecular, Palo Alto, CA). Fully human 
antibodies, such as those expressed in transgenic animals, are also included in the 
invention (Green et al., Nature Genetics 7:13-21, 1994). 
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Ladner (U.S. Patent Nos. 4,946,778 and 4,704,692) describes methods for 
preparing single polypeptide chain antibodies. Ward et al, Nature 341 :544-546, 1989, 
describes the preparation of heavy chain variable domains, which they term "single 
domain antibodies," and which have high antigen-binding affinities. McCafferty et al., 

5 Nature 348:552-554, 1990, shows that complete antibody V domains can be displayed 
on the surface of fd bacteriophage, that the phage bind specifically to antigen, and that 
rare phage (one in a million) can be isolated after affinity chromatography. Boss et al., 
U.S. Patent No. 4,816,397, describes various methods for producing immunoglobulins, 
and immunologically functional fragments thereof, that include at least the variable 

10 domains of the heavy and fight chains in a single host cell. Cabilly et al., U.S. Patent 
No. 4,816,567, describes methods for preparing chimeric antibodies. 

Use of PKCA. Antibodies 

Antibodies to PKCX can be used, as noted above, to detect PKCX or to inhibit the 

15 biological activities of PKCl. For example, a nucleic acid molecule encoding an 
antibody or portion of an antibody can be expressed within a cell to inhibit PKCX 
function. In addition, the antibodies can be coupled to compounds, such as 
radionuclides and liposomes, for diagnostic or therapeutic uses. Antibodies that inhibit 
the activity of aPKCl polypeptide described herein can also be useful in preventing or 

20 slowing the development of a disease caused by inappropriate expression of a wild type 
or mutant PKC1 gene. 

Detection of PKC1 Gene Expression 

As noted, the antibodies described above can.be used to monitor PKC1 gene 

25 expression. In situ hybridization of RNA can be used to detect the expression of PKCX, 
genes. RNA in situ hybridization techniques rely upon the hybridization of a specifically 
labeled nucleic acid probe to the cellular RNA in individual cells or tissues. Therefore, 
RNA in situ hybridization is a powerful approach for studying tissue- and temporal- 
specific gene expression. In this method, oligonucleotides, cloned DNA fragments, or 

30 antisense RNA transcripts of cloned DNA fragments corresponding to unique portions of 
PKCX. genes are used to detect specific mRNA species, e.g., in the tissues of animals, 
such as mice, at various developmental stages. Other gene expression detection 
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techniques axe known to those of skill in the art and can be employed for detection of 
PKCX gene expression. 

Identification of Additional PKO, Genes 

5 Standard techniques, such as the polymerase chain reaction (PCR) and DNA 

hybridization, can be used to clone PKO, gene homologues in other species and PKCX- 
related genes in humans. PKC^-related genes and homologues can be readily identified 
using low-stringency DNA hybridization or low-stringency PCR with human PKCA, 
probes or primers. Degenerate primers encoding human PKQ- or human PKC^-related 

10 amino acid sequences can be used to clone additional PKCA,-related genes and 
homologues by RT-PCR. 

Construction of Transgenic Animals and Knockout Animals 

Characterization of PKCA, genes provides information that allows PKCX 
15 knockout animal models to be developed by homologous recombination. Preferably, a 

PKCX knockout animal is a mammal, most preferably a mouse. Similarly, animal 

models of PKCX. overproduction can be generated by integrating one or more PKCA, 

sequences into the genome of an animal, according to standard transgenic techniques. 

Moreover, the effect of PKCX. mutations (e.g., dominant gene mutations) can be studied 
20 using transgenic mice carrying mutated PKCX transgenes or by introducing such 

mutations into the endogenous PKCX gene, using standard homologous recombination 

techniques. 

A replacement-type targeting vector, which can be used to create a knockout 
model, can be constructed using an isogenic genomic clone, for example, from a mouse 

25 strain such as 129/Sv (Stratagene Inc., LaJolla, CA). The targeting vector can be 

introduced into a suitably derived line of embryonic stem (ES) cells by electroporation to 
generate ES cell lines that carry a profoundly truncated form of a PKCX, gene. To 
generate chimeric founder mice, the targeted cell lines are injected into a mouse blastula- 
stage embryo. Heterozygous offspring can be interbred to homozygosity. PKC^, 

30 knockout mice provide a tool for studying the role of PKCX in embryonic development 
and in disease. Moreover, such mice provide the means, in vivo, for testing therapeutic 
compounds for amelioration of diseases or conditions involving PKCX-dependent or a 
PKCX-effected pathway. 
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Use of PKCX as a Marker for Stem Cells of the Heart 

As PKCk is expressed in cells that give rise to the heart during the course of 
development, it can be used as a marker for stem cells of the heart. For example, PKCX 

5 can be used to identify, sort, or target such stem cells. A pool of candidate cells, for 
example, can be analyzed for PKCX expression, to facilitate the identification of heart 
stem cells, which, based on this identification can be separated from the pool. The 
isolated stem cells can be used for many purposes that are known to those of skill in this 
art. For example, the stem cells can be used in the production of new organs, in organ 

1 o culture, or to fortify damaged or transplanted organs. 

Experimental Results 

Concentramide specifically modulates a biological pathway involved in heart patterning 
Zebrafish embryos have recently been shown to be amenable to high-throughput 

15 screening to identify small molecules that perturb developmental processes (Peterson et 
al., Proc. Natl. Acad. Sci. U.S.A. 97:12965-12969, 2000). In one such screen, we 
exposed developing zebrafish embryos to small molecules from a large, diverse chemical 
library. Visual inspection of the transparent embryos was used to identify small 
molecules that affect the global patterning of the heart. One of these small molecules is 

20 a biaryl compound containing an acrylamide moiety that we call concentramide (Fig. 
1A), originally identified as library number 32P6 (Peterson et al., supra). 

Normally, by 24 hours post-fertilization (hpf) the heart tube assembles in the 
midline, with the atrium anterior to the ventricle and slightly displaced towards the left 
(Fig. 2A), and blood flow is driven from atrial to ventricular end, first by persistalsis and 

25 then by sequential chamber contractions. By 30 hpf, the chambers are clearly 

demarcated (Fig. 2B, using cardiac myosin light chain 2, cmlc2, to label both chambers) 
and express different genes, as shown in Fig. 2C (ventricle-specific myosin heavy chain 
and atrial-specific antibody S46). 

Embryos exposed to concentramide develop compact hearts that do not sustain a 

30 circulation. It appears that both the atrium and ventricle form and beat in a coordinated 
manner in these fish, but that the ventricle forms in the center of the atrium, as shown in 
Figs. 2E and 2F. The result is a heart in which the atrium and ventricle form two 
concentric rings, the inner ring composed of the ventricle and the outer ring composed of 
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the atrium. From the dorsal view, the heart looks like a bullseye (Fig. 2D), and from the 
lateral view, it looks like an inverted mushroom, in which the ventricle forms the stalk of 
the mushroom and the atrium surrounds and covers the ventricle like a mushroom cap 
(Fig. IB). 

5 Several observations suggest that concentramide is a highly specific modulator of 

a particular molecular pathway critical to heart patterning. Concentramide is very 
potent, with an ED 50 of about 2 nM. More importantly, higher doses of concentramide 
do not appear to cause additional side effects. Concentramide causes virtually the same 
phenotype when used at a concentration of 6 uM as it does when used at a concentration 

10 of 6 nM, suggesting that it modulates a specific molecular target at least 1 ,000 times 

more potently than it modulates other proteins affecting visible developmental processes. 
The effect of concentramide on cardiovascular development does not appear to be a 
result of general cytotoxicity. Development of concentramide-treated embryos is not 
delayed relative to untreated siblings, and no increase in cell death is apparent. 

15 Concentramide also has no effect on the rate of proliferation of yeast or 

bromodeoxyuridine incorporation in mammalian cells. Given the potency of 
concentramide, its phenotypic reproducibility over a broad concentration range, and the 
rarity of the phenotype it produces (none of the >2000 other small molecules screened 
generates a similar phenotype), we conclude that concentramide is a specific modulator 

20 of a biological pathway responsible for heart patterning. 

A time -window for concentramide effects 

One advantage of small molecules over genetic mutations in studying a 
developmental process is that small molecules allow the process to be modulated with 

25 much greater temporal control. Small molecules can be added or washed away at any 
time during development, whereas genetic mutations are generally present throughout 
development. This temporal control afforded by small molecules facilitates the 
identification of critical periods for developmental processes. 

To identify the developmental stage at which concentramide disrupts heart 

30 patterning decisions, we added concentramide to the water of developing embryos at 
various times. As shown in Fig. 1C, embryos treated at any time prior to 14 hpf exhibit 
the concentric chamber morphology at 24 hpf while embryos treated after 17 hpf exhibit 
wild-type heart morphology at 24 hpf. Repeating the experiment with more precise 
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staging revealed that concentramide must be present before the 14-somite stage 
(approximately 15 hpf) to induce the concentric chamber morphology. Therefore, a 
developmental event occurring at the 14-somite stage is critical for heart patterning and 
is disrupted by the small molecule concentramide. 

5 

Tlie hearts of concentramide-treated embryos phenocopy heart-and-soul mutants 

Heart-and-soul (has) is a mutation isolated in our large-scale genetic screen. The 
hearts of homozygous has mutant embryos are small. We find here that, like those of 
concentramide-treated embryos, the hearts of has mutant embryos have ventricular tissue 

10 within the atrium (Figs. 2G and 2H). They manifest radial sequential contractions of the 
atrium, then the ventricle. The has mutant embryos, however, also manifest defects in 
many tissues including the retina, kidney, gut, and brain. These defects are not present 
in concentramide-treated embryos. The brains of concentramide-treated embryos 
develop abnormally, but treating embryos between 9 and 14 hpf eliminates this brain 

15 defect, while preserving the concentric heart chamber phenotype (Fig. 1C). Therefore, 
the heart phenotypes of concentramide-treated and has mutant embryos are very similar, 
but concentramide-treated embryos appear to have fewer developmental defects 
elsewhere, and the cardiac specificity of the phenotype can be increased further by 
controlling the timing of concentramide treatment. 

20 

Heart-and-soul encodes an atypical PKCX 

Given the phenotypic similarities between hearts from has and concentramide- 
treated embryos, we reasoned that cloning the has gene might provide molecular insight 
about the process of heart patterning. Furthermore, cloning of has might allow us to 

25 determine whether has and concentramide influence heart patterning through similar or 
distinct mechanisms. We mapped has by linkage analysis with zebrafish SSR markers 
(Michehnore et al., Proc. Natl. Acad. Sci. U.S.A. 88:9828-9832, 1991; Knapik et al., 
Nat. Genet. 18:338-343, 1998; Shimoda et al., Genomics 58:219-232, 1999) and AFLP 
(Vos et al., Nucleic Acids Res. 23:4407-4414, 1995) to an interval flanked by markers 

30 z8451 and zl 1023 of approximately 1 .IcM (Fig. 3A). These were used to initiate a walk 
using YACs and BACs, which proceeded by end-cloning, refined mapping, and 
ultimately sequencing. Genes identified as candidates for the mutation were assayed by 
in situ analysis and for cDNA polymorphism by RT-PCR of wild-type and mutant RNA 
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pools. The genes contained within the BACs are shown in Table 1 . The gene 
assignments are based on BLASTX alignments. 



Table 1. Candidate genes identified within the heart-and-soul interval 

5 



BAC address 


identified genes (GenBank accession#) 


109fl0/122nl7 


KIAA0670 protein/acinus (NP_055792) 




membane-type 1 metalloproteinase precursor (AAD13803) 




adaptin, gamma (NP_001 1 19) 




KIAA1416 protein, novel Helicase C-terminal domain and SNF2 N-termina 




domains containing protein, similar to KIAA0308 (CAB57836) 




ZPC domain containing protein 2 (AAD38907) 




zinc finger protein sal (AAB51127) 




cerebellin 1 precursor (NP_004343) 




RING finger protein (AAB05873) 


152p21 


unknown (NP_056541) 


89il5 


precerebeUin-like protein (AAF04305) 


23cl4 


PKCX 




transforming protein sno-N - chicken (151298) 


53cl7 


no genes detected by BLASTX (mostly repetitive) 



By sequencing PKCX from wild type and mutant embryos, we confirmed that 
both has alleles harbor mutations in the PKCX coding sequence. The mutation in the 
m567 allele causes a premature stop codon after amino acid 518, and the mutation in the 

10 ml29 allele causes a premature stop codon after amino acid 514 (Fig. 3A). We 
determined the complete genomic structure of the zebrafish PKCX. gene by shotgun 
sequencing of BAC 23c 14. It is comprised of 18 exons spanning approximately 45 kb. 
We find FKCX mRNA to be expressed in a broad range of tissues. 

The C-terminal truncation of PKQ, does not appear to destabilize the protein, as 

15 truncated protein is detected by western blot analysis of mutant embryos (Fig. 3B). 
However, truncation might be predicted to eliminate a domain essential for PKCX 
function, given that C-terminal truncation of PKCa or PKCP renders these related 
kinases catalytically inactive (Riedel et al., J. Cell. Biochem. 52:320-329, 1993; Riedel 
et al., Mol. Cell. Biol. 13:4728-4735, 1993). In order to confirm the role of PKCX 

20 mutation in the phenotype, we injected antisense morpholino oligomers complementary 
to the PKCX translational start site. These injections phenocopy the mutation entirely. 
The injected embryos are indistinguishable at the gross morphological level from the 
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genetic mutants (Fig. 3C), supporting the idea that loss of the C-terminal 70 amino acids 
is sufficient to eliminate gene function. 

The integrity of epithelia is affected by PKCX mutation, but not by freatment with 
5 concentramide 

PKCX belongs to the large PKC family of kinases and, with PKCt|, is classified 
as an 'atypical 5 PKC (Mellor et al., Biochem. J. 332:281-292, 1998). The presumptive 
oitholog of PKCX in C. elegans, PKC-3, colocalizes with Par3 and Par6 at the anterior 
pole of the one-cell embryo (Tabuse et al., supra; Hung et al., Development 126:127- 

10 135, 1999). PKC-3 is necessary for establishment of embryonic polarity, and 

inactivation of PKC-3 leads to mislocalization of the Par genes and a symmetrical first 
cell division. Drosophila possesses only one atypical PKC (DaPKC), which also 
associates with a Par3-lilce protein (Bazooka) and is implicated in control of cell polarity 
(Wordarz et al., supra). DaPKC mutants exhibit disordered epithelial layering, irregular 

15 cell shapes, and loss of epithelial cell polarity, believed to be due to defects in cell 

adhesion. In vertebrate cells, PKCX and PKC£ both localize to epithelial tight junctions 
and associate with a Par3-like protein (ASIP) (Joberty et al., Nat. Cell Biol. 2:531-539, 
2000; Suzuki et al., J. Cell Biol. 152:1 183-1 196, 2001; Lin et al., Nat. Cell Biol. 2:540- 
547, 2000; Izumi et al., J. Cell Biol. 143:95-106, 1998). We therefore examined whether 

20 the has mutation and concentramide treatment perturb epithelial patterning and tight 
junctions, focusing upon the retina and the kidney. 

The neural retina arises from an epithelial sheet that is bordered by the lens on 
the basal surface and by a second epithelial sheet (the retinal pigmented epithehum, 
PvPE) on the apical surface (Schmitt et al., J. Comp. Neurol. 344:532-542, 1994). Prior 

25 to cell differentiation, the nuclei of the neuroepithelial cells migrate between the apical 
and basal surfaces of the epithelium. During M-phase, cell nuclei localize to the apical 
surface, adjacent to the neighboring RPE (Sauer, J. Comp. Neurol. 62:377-405, 1935). 
Beginning at about 30 hpf, these neuroepithelial cells exit the cell cycle and differentiate 
into one of seven distinct cell types (Altshuler et al., "Specification of Cell Type in the 

30 Vertebrate Retina," In Development of the Visual System, Lam et al. (Eds.), The MIT 
Press, Cambridge, MA 37-58, 1991; Dowling, "The Retina," Belknap Press, Cambridge, 
MA, 1987). Each cell type then migrates to a specific layer in the retina, resulting in a 
highly organized, laminar pattern (see Fig. 4A). 
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The has mutation causes disruption of the layering of the neural retina and patchy 
loss of the RPE (Fig. 4E). These defects resemble those noted previously in zebrafish 
bearing the mutations oko meduzy (pme) and mosaic eyes (moe) (Jensen et al., 
Development 128:95-105, 2001; MalicM et al., Development 126:1235-1246, 1999). In 

5 has mutants, the severity of laminar disruption correlates with the position and degree of 
RPE discontinuity, suggesting that the RPE epithelial defect causes or exacerbates that 
of the neural retina. This would be concordant with the evidence that a normal RPE is 
critical to lamination (Raymond et al., Curr. Biol. 5:1286-1295, 1995; Vollmer et al., 
Neurosci. Lett. 48:191-196, 1984) and the fact that the retinal epithelium of has mutants 

10 manifests at least one attribute of proper apical-basal polarity in that the majority of the 
mitotic nuclei localize correctly to the apical surface of the neuroepithelium (Figs. 4B, 
4D, and 4F; 89% of M-phase nuclei from has embryos localize to the apical surface 
versus 97% of nuclei from wild-type embryos). As a marker of tight junctions, we 
examined immunoreactive zonula occludens (ZO-1), an integral tight junction protein, 

15 and find it to be mislocalized (Figs. 4G and 4H). Therefore, loss of adhesion between 
RPE cells may be a cause of retinal mispatterning in has mutants. Notably, retinas from 
concentramide-treated embryos do not exhibit defects in cell polarity (Fig. 4D), RPE 
continuity, or lamination (Fig. 4C) 

The developing kidney is another structure composed of highly polarized 

20 epithelial cells. We examined the distribution of apical and basolateral proteins in the 
kidneys of wild-type, has, and concentramide-treated embryos. As in the retina, cell 
polarity appeared to be largely conserved in has kidneys (Figs. 5A-5C). The has kidneys 
did, however, exhibit irregularities in the shapes of epithelial cells and occasional gaps 
between cells, consistent with a defect in epithelial cell adhesion. We did not observe 

25 these defects in embryos exposed to concentramide. 

Given the differences between has and concentramide-treated embryos with 
regard to epithelial sheet integrity in the retina and the kidney, it is unlikely that 
concentramide functions through the same mechanism as the has mutation, namely the 
inactivation of PKCL To examine this further, we tested the effect of concentramide on 

30 early development of the C. elegans embryo. In C. elegans, inactivation of the PKCk 
ortholog PKC-3 via RNA interference (RNAi) results in the loss of polarized 
localization of the Par proteins and loss of asymmetry during the first cell division 
(Tabuse et al., supra). Embryos treated with high concentrations of concentramide retain 
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proper localization of Par2 to the posterior pole and undergo a normally asymmetric first 
cell division (Figs. 5D and 5E). Treated embryos exhibit cytokinetic defects and fail to 
complete development, suggesting that the absence of an asymmetry defect is not due to 
problems with compound penetration. Therefore, although concentramide treatment and 
5 PKCA, inactivation both result in similar heart patterning phenotypes, concentramide 
does not appear to inactivate zebrafishPKCXor its nematode ortholog. 

Hie molecular target of concentramide is involved in AP patterning 

If the molecular target of concentramide does not affect the continuity of 

10 epithelial sheets as PKCX, does, by what sort of process might it influence heart 

patterning? Treatment with concentramide appears to affect the relative positions of 
several anatomical structures along the anterior-posterior (AP) axis. For example, the 
distance between Pax2.1 -expressing cells in the eyes and at the midbrain/hindbrain 
boundary is reduced in concentramide-treated embryos (Figs. 6A-6C). Perhaps more 

15 significantly, the cardiac myosin light chain 2 (cmlc2)-expressing cells of the heart field 
are shifted rostrally in concentramide-treated embryos at the 1 8-somite stage (Fig. 6D). 
The distance between the anterior edge of the cmlc2-expressing field and the anterior 
extreme of the embryo is about 40 percent greater in wild-type embryos (3.1 +/- 0.2 
arbitrary units, n=8) than in concentramide-treated embryos (2.2 +/- 0.3 arbitrary units, 

20 n=12). The position of the heart field in has mutants (3.1 +/- 0.3 arbitrary units, n=12) 
does not differ significantly from the wild-type position. Therefore, the molecular target 
of concentramide appears to play a role in AP patterning. 

PKCX and the target of concentramide both influence the fusion order of heart 
25 primordia 

PKd and the molecular target of concentramide appear to act via distinct 
cellular mechanisms, but modulation of either results in a very similar change in the 
patterning of the heart. To identify the commonalties between the two mechanisms that 
allow such similar nnspatterning of the heart, we took advantage of the temporal control 
30 with which small molecules can modulate biological processes. As described above, we 
determined that embryos must be treated with concentramide at or prior to the 14-somite 
stage to cause formation of the ventricle wilhin the atrium. From this observation, we 
conclude that a critical heart patterning process is initiated shortly after the 14-somite 
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stage, and perturbation of this process results in the concentric chamber phenotype 
observed in both has and concentamide-treated embryos. This allowed us to focus our 
search for commonalties between has and concentramide-treated embryos to this critical 
time period. 

5 The generation of the prirnitive heart tube is accomplished by midline 

coalescence of the bilateral cardiac primordial sheets. In the zebrafish, this coalescence 
first generates a single midline cone, with its base on the yolk (Fishman et al., supra; 
Yelon et al., Dev. Biol. 214:23-37, 1999). Subsequently, the cone tilts to assume a 
midline A-P orientation with the pre-ventricular end posterior, later to swing anteriorly 

10 as yolk is resorbed. 

We find that normally the generation of the midline cone does not occur 
uniformly around the cone's circumference, but rather progresses from posterior to 
anterior, with posterior regions merging at the 1 6-somite stage and anterior at the 1 8- 
somite stage. This step is perturbed by both concentramide and the has mutation. In 

15 both has mutant embryos and concentramide-treated embryos, there is a failure to merge 
the posterior ends (Figs. 7A-7C). Even by the 1 8-somite stage, when the anterior ends of 
the primordia begin to fuse normally, the posterior ends remain separated in the has and 
concentramide-treated embryos (Figs. 7D-7F). Eventually, the posterior ends do fuse in 
has and concentramide-treated embryos, just before emergence of the concentric 

20 chambered heart. Thus, a critical patterning decision occurs at about the 1 6-somite stage 
that regulates the fusion order of the anterior and posterior ends of the heart field. This 
process can be blocked either by inactivation of PKCX, or by modulating the target of 
concentramide. 

Thus, in summary, we have defined a key step in heart formation by its 
25 perturbation with a small molecule and a mutation. This step involves the proper 

alignment of the two cardiac chambers, just as the primitive heart tube assembles. Two 
perturbants -the small molecule concentramide and the has mutation-- both elicit a 
previously undescribed chamber malalignment, in which the ventricle forms inside of the 
atrium. This means that establishment of the cardiocyte cell fates is largely 
30 accomplished, but the higher order assembly of chamber structure is disrupted. 
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Experimental Methods 
Small molecule treatment 

Zebrafish were maintained at 28.5°C as described (Westerfield, "The Zebrafish 
Book, Guide for the Laboratory Use of Zebrafisb (Danio rerid)" Univ. of Oregon Press, 
5 Eugene 1995). Unless specified otherwise, embryos were treated prior to gastrulation by 
adding concentramide to the water at a final concentration of 34 nM from a 34 uM stock 
solution in DMSO. 



Whole-mount in situ hybridization and immnnohistochemistiy 
10 Digoxigenin-labeled antisense RNA probes were generated by in vitro 

transcription for cmlc2 (Yelon et al., supra), vmhc (Yelon et al., supra), andpax2.1 
(Krauss et al., Development 1 13:1193-1206, 1991). In situ hybridization was carried out 
as described (Oxtoby et al., Nucleic Acids Res. 21 :1087-1095, 1993). For whole-mount 
inrmunohistochemistry, embryos were fixed in 4% paraformaldehyde in phosphate- 
15 buffered saline (S46 and 3G8) or 80% methanol, 20% dimethyl sulfoxide (a-ZO-1), 
permeablized in acetone for 30 minutes at -20°C (3G8), blocked with 5% fetal bovine 
serum, and incubated with the antibodies S46, 3G8 (Vize et al., Dev. Biol. 171 :53 1-540, 
1 995), or a-ZO-1 . An anti-mouse-horseradish peroxidase conjugate was used as 
secondary antibody for S46 and 3G8, and an Alexa 488-labeled anti-mouse secondary 
20 antibody was used for a-ZO-1 staining. 



Histology 

Fixed embryos were dehydrated, embedded in plastic (JB-4, Polysciences, Inc.), 
and sectioned at 2-7 um. Retinal sections were stained with hematoxylin-eosin or dapi. 

25 

Cloning of has 

Embryos were separated into mutant and wild-type pools based on phenotypic 
analysis. Genomic DNA was isolated from individual embryos by incubation in DNA 
isolation buffer overnight at 50°C (DNA isolation buffer: 10 mM Tris-HCl, pH 8.3; 50 
30 mM KC1; 0.3% Tween-20; 0.3% Nonidet P40; 0.5 mg/ml proteinase K). Proteinase K 
was inactivated prior to PCR setup by heating samples to 98°C for 10 minutes. PCR 
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reactions were performed using diluted genomic DNA as described (Knapik et al., 
Development 123:451-460, 1996). 

RNA was isolated (RNeasy columns, Qiagen) from pools of wild-type and 
mutant embryos to generate cDNA for RT-PCR analysis (SMART RACE cDNA 
5 amplification kit, Clontech). Fragments were then subcloned into PCRII-TOPO 

(Tnvitrogen). PCR primers were synthesized based on sequence from an EST for PKCX 
(fc69h04, GenBank accession# AI883774) and genomic sequence (Genome Systems, 
BAC clone address 23cl4), and used to sequence the entire PKCk coding region and 
3UTR. 

10 Genomic clones were isolated by PCR analysis of DNA pools from BAC 

(Genome Systems) and YAC (Research Genetics) libraries using primer sets for the 
linked markers zl 1023 and z8451. YAC end sequence was determined as described 
(Zhong et al., Genomics 48:136-138, 1998). BAC ends were sequenced directly using 
SP6 and T7 primers, and BACs 53cl7, 89il5, and 152p21 were subcloned by shotgun 

15 cloning of partial AM digested fragments into pBluescript. For the complete sequencing 
of BACs, a hydroshear was used to produce fragments of 2-3kb in length. These 
fragments were then blunt-end ligated into pGEM5 (Promega) and sequenced using an 
ABB 700 to generate approximately five-fold coverage. The sequence was assembled 
using the Phred/Phrap/Consed programs (Gordon et al., Genome Res. 8: 195-202, 1998; 

20 Ewing et al., Genome Res. 8: 186-194, 1998; Ewing et al., Genome Res. 8:175-185, 
1998). 

Western blotting 

Groups of 25 embryos were lysed in 0.5% Triton XI 00 in phosphate-buffered 
25 saline. Lysates were clarified by centrifugation and separted by 10% sodium dodecyl 
sulfate polyacrylamide gel electrophoresis. Western blotting was performed using an a- 
PKCX rabbit polyclonal antibody (Santa Cruz Biotechnology, Inc.). 

Morpholino injection 
30 An antisense morpholino oligonucleotide of sequence 5'- 

CTGTCCCGCAGCGTGGGCATTATGG-3 ' (GeneTools, LLC) was dissolved at a 
concentration of 100 yM in IX Danieau's buffer (5 mM Hepes pH 7.6, 58 mM NaCl, 
0.7 mM KC1, 0.6 mM Ca(N0 3 ) 2 , 0.4 mM MgSQ 4 ). One nL of this solution or IX 
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Danieau's buffer was injected into each 1-4 cell embryo before allowing the embryos to 
develop at 28.5°C. 



C. elegans development 
5 C. elegans strain KK871 (par-2::GEP) was maintained at 25°C. For each sample, 

10-15 adult worms were soaked in 80 U.L M9 medium containing 34 (xM concentramide, 
0.25% dimethyl sulfoxide for 30-60 rninutes. Worms were then cut open with a scalpel, 
and embryos were mounted on 2% agarose pads with coverslips. Embryos were allowed 
to develop at 25°C before being photographed live. 

10 

Other Embodiments 

All publications and patent applications mentioned in this specification are herein 
incorporated by reference to the same extent as if each independent publication or patent 
application was specifically and individually indicated to be incorporated by reference. 

15 While the invention has been described in connection with specific embodiments 

thereof, it is to be understood that it is capable of further modifications and this 
application is intended to cover any variations, uses, or adaptations of the invention 
following, in general, the principles of the invention and including such departures from 
the present disclosure that come within known or customary practice within the art to 

20 which the invention pertains and can be applied to the essential features hereinbefore set 
forth, and follows in the scope of the appended claims. 
What is claimed is: 
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Claims 

1 . A method of determining whether a test subject has, or is at risk of 
developing, a disease or condition related to Protein Kinase C X, said method comprising 
analyzing a nucleic acid molecule of a sample from the test subject to detennine whether 

5 the test subject has a mutation in a gene encoding said Protein Kinase C X, wherein the 
presence of a mutation indicates that said test subject has, or is at risk of developing, a 
disease or condition related to Protein Kinase C X. 

2. The method of claim 1, wherein said test subject is a mammal. 

10 

3. Hie method of claim 1, wherein said test subject is a human. 

4. The method of claim 1, wherein said disease or condition is a disease or 
condition of the heart or cancer. 

15 

5. The method of claim 1, wherein said disease or condition is associated with 
epithelial-epithelial cell interactions or epithelial cell polarity. 

6. The method of claim 1, wherein said mutation results in a carboxyl tenxiinal 
20 truncation of Protein Kinase C X. 

7. The method of claim 1 , wherein said mutation is the heart and soul mutation. 

8. A method for identifying a compound that can be used to treat or to prevent a 
25 disease or condition of associated with Protein Kinase C X, said method comprising 

contacting an organism comprising a mutation in a gene encoding Protein Kinase C X 
and having a phenotype characteristic of a disease or condition associated with Protein 
Kinase C X with said compound, and determining the effect of said compound on said 
phenotype, wherein detection of an improvement in said phenotype indicates the 
30 identification of a compound that can be used to treat or to prevent said disease or 
condition. 
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9. The method of claim 8, wherein said disease or condition associated with 
Protein Kinase C X is heart disease. 

1 0. The' method of claim 8, wherein said organism is a zehrafish. 

11. The method of claim 8, wherein said mutation results in a carhoxyl terminal 
truncation of Protein Kinase C X. 

12. A method of treating or preventing a disease or condition associated with 
Protein Kinase C X in a patient, said method comprising admimstering to said patient a 
compound identified using the method of claim 8. 

13. The method of claim 12, wherein said disease or condition is of the heart. 

14. The method of claim 12, wherein said patient has a mutation that results in a 
carhoxyl terminal truncation of Protein Kinase C X. 

15. A method of treating or preventing a disease or condition associated with 
Protein Kinase C X in a patient, said method comprising administering to said patient a 
functional Protein Kinase C X protein or an expression vector comprising a nucleic acid 
molecule encoding said protein. 

16. A method of treating or preventing a disease or condition associated with 
Protein Kinase C X in a patient, said method comprising administering to said patient a 
compound or molecule that alters the activity or expression of Protein Kinase C X in said 
patient. 

17. A substantially pure zehrafish Protein Kinase C X polypeptide. 

1 8. The polypeptide of claim 17, wherein said polypeptide comprises an amino 
acid sequence that is substantially identical to the amino acid sequence of SEQ ID NO:2. 
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19. The polypeptide of claim 17, wherein said polypeptide comprises the amino 
acid sequence of SEQ ID NO:2. 

20. A substantially pure polypeptide comprising the sequence of SEQ ID NO:2 
5 and variants thereof comprising sequences that are at least 95% identical to that of SEQ 

ID NO:2, and which have Protein Kinase C X activity. 

21. An isolated nucleic acid molecule comprising a sequence encoding a 
zehrafish Protein Kinase C X polypeptide. 

10 

22. The nucleic acid molecule of claim 21, wherein said nucleic acid molecule 
encodes a polypeptide comprising an amino sequence that is substantially identical to the 
amino acid sequence of SEQ ID NO:2. 

15 23 . The nucleic acid molecule of claim 2 1 , wherein said nucleic acid molecule 

encodes a polypeptide comprising the aroino acid sequence of SEQ ID NO:2. 

24. An isolated nucleic acid molecule that specifically hybridizes under high 
stringency conditions to the complement of the sequence set forth in SEQ ID NO:l, 

20 wherein said nucleic acid molecule encodes a protein that has Protein Kinase C X 
activity. 

25. A vector comprising the nucleic acid molecule of claim 21 . 

25 26. A cell comprising the vector of claim 25. 

27. A non-human animal having a knockout mutation in one or both alleles 
encoding a Protein Kinase C X polypeptide. 

30 28. A cell from the non-human knockout animal of claim 27. 

29. A non-human transgenic animal comprising a nucleic acid molecule 
encoding a mutant Protein Kinase C X polypeptide. 
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30. The non-human transgenic animal of claim 29, wherein the non-human 
transgenic animal is a zebrafish. 

5 3 1 . The non-human transgenic animal of claim 29, wherein the non-human 

transgenic animal comprises the heart and soul mutation. 

32. An antibody that specifically binds to a Protein Kinase C X polypeptide. 

10 33. A method of modulating the activity of a Protein Kinase C X polypeptide in a 

patient, said method comprising administering to the patient an RNA that stimulates or 
inhibits this activity. 
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SEQUENCE LISTING 

<110> The General Hospital Corporation 

<120> Methods for Diagnosing and Treating 

Diseases and Conditions Associated with Protein Kinase 
Clambda 

<130> 007BS/40SWO2 

<150> US 60/317,653 
<151> 2001-09-06 

<160> 5 

<170> PastSEQ for Windows Version 4.0 

<210> 1 

<211> 3437 

<212> DNA 

<213> Danio rerio 

<220> 
<221> CDS 

<222> (143) . . . (1906) 
<400> 1 

gccaggctgt ttatttaacc ggagacggca ctattgctgt ccaaagaata cgtttagttt 60 
taaaactccg gtagtttttc ctcgtcagac gatagctggc tagcatcatt agctaagcta 120 
gcaggagtac ggatagtcca ta atg ccc acg ctg egg gac age acc atg tec 172 

Met Pro Thr Leu Arg Asp Ser Thr Met Ser 
15 10 

cac ccc gga gaa aac ccg cac caa gtc egg gta aaa gec tac tac aga 220 
His Pro Gly Glu Asn Pro His Gin Val Arg Val Lys Ala Tyr Tyr Arg 
15 20 25 

ggg gac ate atg ate aca cat ttt gag cct teg ate tec tat gag gga 268 
Gly Asp He Met He Thr His Phe Glu Pro Ser He Ser Tyr Glu Gly 
30 35 40 

etc tgc aat gag gtg cgt gat atg tgc tec atg gac aat gac cag etc 316 
Leu Cys Asn Glu Val Arg Asp Met Cys Ser Met Asp Asn Asp Gin Leu 
45 50 55 

ttc acc atg aaa tgg att gat gag gaa ggg gat ccg tgc acc gtt tct 3 64 
Phe Thr Met Lys Trp He Asp Glu Glu Gly Asp Pro Cys Thr Val Ser 
60 65 70 

tct cag ctg gag ctg gag gag gec ttg cgt eta tat gaa etc aac aaa 412 
Ser Gin Leu Glu Leu Glu Glu Ala Leu Arg Leu Tyr Glu Leu Asn Lys 
75 80 85 90 

gac teg gag etc att att cac gtg ttt cct tgt gtc cct gaa aaa cct 460 
Asp Ser Glu Leu He He His Val Phe Pro Cys Val Pro Glu Lys Pro 
95 100 105 



-1- 



ggc atg ccc tgt cct gga gaa gac aag tct ata tac egg egg gga get 508 
Gly Met Pro Cys Pro Gly Glu Asp Lys Ser He Tyr Arg Arg Gly Ala 
110 115 120 

cga cgt tgg agg aaa etc tac tat gee act gga cat gcg ttt cag gee 556 
Arg Arg Trp Arg Lys Leu Tyr Tyr Ala Thr Gly His Ala Phe Gin Ala 
125 130 135 

aaa cgc ttt aac agg cgt get cat tgt gee ate tgc aca gat cgt ate 604 
Lys Arg Phe Asn Arg Arg Ala His Cys Ala He Cys Thr Asp Arg He 
140 145 150 

tgg ggt °tg gg c a gg ca -g gg a tac aa g tgt a tc aac tgt aag ctt ctg 652 

Trp Gly Leu Gly Arg Gin Gly Tyr Lys Cys He Asn Cys Lys Leu Leu 
155 160 165 170 

gtg cat aag aaa tgc cat aag ctg gtc aca gta gaa tgt ggt aga cag 70 0 
Val His Lys Lys Cys His Lys Leu Val Thr Val Glu Cys Gly Arg Gin 
175 180 185 

gta ata cag gac cca atg ate gga aga ate gat cca ggg teg act cat 74 8 
Val He Gin Asp Pro Met He Gly Arg He Asp Pro Gly Ser Thr His 
190 195 200 

cca gag cac cca gat caa gtt ctg ggc aaa aag aac tea aca gaa age 796 
Pro Glu His Pro Asp Gin Val Leu Gly Lys Lys Asn Ser Thr Glu Ser 
205 210 215 

ate aat cat gag gga gag gag cat gag get gtg ggc agt egg gaa tea 844 
He Asn His Glu Gly Glu Glu His Glu Ala Val Gly Ser Arg Glu Ser 
220 225 230 

gga aaa gcg gtg tec agt ttg ggt eta ata gac ttt gac ctg ctg cga 892 
Gly Lys Ala Val Ser Ser Leu Gly Leu He Asp Phe Asp Leu Leu Arg 
235 " 240 245 250 

gtg att ggc agg ggc age tac gee aaa gtt ctg ctg gtg cgt etc aaa 940 
Val He Gly Arg Gly Ser Tyr Ala Lys Val Leu Leu Val Arg Leu Lys 
255 260 265 

aag aca gaa cgc ate tat gec atg aag gtg gtg aag aag gag ctg gtc 988 
Lys Thr Glu Arg He Tyr Ala Met Lys Val Val Lys Lys Glu Leu Val 
270 275 280 

aac gat gac gag gat att gac tgg gtt cag act gaa aag cat gtg ttt 103 6 
Asn Asp Asp Glu Asp He Asp Trp Val Gin Thr Glu Lys His Val Phe 
285 290 295 

gag cag get tea aac cat ccc ttc ctt gtg gga ctt cac tec tgc ttc 1084 
Glu Gin Ala Ser Asn His Pro Phe Leu Val Gly Leu His Ser Cys Phe 
300 305 310 

cag acg gag age aga ctg ttc ttt gta ate gag tat gtg aat gga ggg 1132 
Gin Thr Glu Ser Arg Leu Phe Phe Val He Glu Tyr Val Asn Gly Gly 
315 320 325 330 

gat etc atg ttc cac atg cag egg cag agg aaa ctt ccg gaa gag cac 1180 
Asp Leu Met Phe His Met Gin Arg Gin Arg Lys Leu Pro Glu Glu His 
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335 



340 



345 



gcc agg ttt tac tct gca gag ate agt ctt gec ttg aac tac etc cat 1228 
Ala Arg Phe Tyr Ser Ala Glu lie Ser Leu Ala Leu Asn Tyr Leu His 
350 355 360 

gag cgt ggc att att tac agg gac ctg aaa ctg gac aat gtt ctg ctg 1276 
Glu Arg Gly lie lie Tyr Arg Asp Leu Lys Leu Asp Asn Val Leu Leu 
365 370 375 

gat tea gag gga cac ate aaa etc act gat tac ggc atg tgt aag gag 1324 
Asp Ser Glu Gly His lie Lys Leu Thr Asp Tyr Gly Met Cys Lys Glu 
380 385 390 

gga ctg aga cca gga gat aca ace age act ttc tgt gga act ccc aat 13 72 
Gly Leu Arg Pro Gly Asp Thr Thr Ser Thr Phe Cys Gly Thr Pro Asn 
395 400 405 410 

tac att gca cca gag att ctg aga gga gaa gac tat ggt ttt agt gtg 1420 
Tyr He Ala Pro Glu He Leu Arg Gly Glu Asp Tyr Gly Phe Ser Val 
415 420 425 

gac tgg tgg get ctg ggc gtc ctg atg ttt gag atg atg get gga aga 1468 
Asp Trp Trp Ala Leu Gly Val Leu Met Phe Glu Met Met Ala Gly Arg 
430 435 440 

tct ccc ttc gac ata gtc ggc age tct gat aac cct gac caa aac aca 1516 
Ser Pro Phe Asp He Val Gly Ser Ser Asp Asn Pro Asp Gin Asn Thr 
445 450 455 

gag gat tat ctt ttc caa gtc att ttg gag aag cag ate aga att ccc 1564 
Glu Asp Tyr Leu Phe Gin Val He Leu Glu Lys Gin He Arg He Pro 
460 " 465 470 

aga teg tta teg gtc aaa gcc gca age gtg ctg aag gga ttc etc aac 1612 
Arg Ser Leu Ser Val Lys Ala Ala Ser Val Leu Lys Gly Phe Leu Asn 
475 480 485 490 

aag gag teg aag gaa egg ctg gga tgt cat cct cag aca ggc ttc gca 1660 
Lys Glu Ser Lys Glu Arg Leu Gly Cys His Pro Gin Thr Gly Phe Ala 
495 500 505 

gac ate atg gcc cat cct ttt ttc cga aat gta gac tgg gat ctt atg 1708 
Asp He Met Ala His Pro Phe Phe Arg Asn Val Asp Trp Asp Leu Met 
510 515 520 

gag cag aag caa gta gtt cca ccg ttc aaa cct aac ate teg ggc gag 1756 
Glu Gin Lys Gin Val Val Pro Pro Phe Lys Pro Asn He Ser Gly Glu 
525 530 535 

ttt ggt ctg gat aac ttt gat gcc cag ttc acc aac gag ccc att cag 18 04 
Phe Gly Leu Asp Asn Phe Asp Ala Gin Phe Thr Asn Glu Pro He Gin 
540 545 550 

etc acg cct gac gat gat gat get gta aag aag ate gac cag tct gag 1B52 
Leu Thr Pro Asp Asp Asp Asp Ala Val Lys Lys He Asp Gin Ser Glu 
555 560 565 570 
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ttt gaa ggc ttc gag tac ate aac cct ctg ctg atg tct gcg gag gag 1900 
Phe Glu Gly Phe Glu Tyr lie Asn Pro Leu Leu Met Ser Ala Glu Glu 
575 580 585 

tgt gtg tgaaeggteg ctttatccct ctgttactcg catatcatcg ctgectttat 1956 
Cys Val 



ttgcatggtc gcaatcaatc acacgaaagg aagcaacaag aacctgactt tgctttgttg 2016 
ggaccagatg aaacagtaac tttgccaaat gtctttcact ttctgecatt tgtaaccact 2076 
agtccttaag tgtctatttt tttctcaatt atttttgtat catgttaatc agcagcactg 2136 
atgaaaggac atttgtcagt gccttcgacc aacagtttta getttcegga ctctgcaaac 2196 
taaaggaaaa aagaatgact gtgatggtac gcaggacctc ecaatgetaa agatatgeat 2256 
tttattttgt aaatatgaaa gagaatcctt tgagcatata tagtaageca ttttaaaact 2316 
ctataccaca tgggatattc ttgaagaaag tttctgatta tctgttttct gtagcgtaag 23 76 
gatgagaaca ctttgtttta ttactatatt tttatttaag agtactgtca tctatataga 2436 
atgtgcacaa tgtgttgaat cagagttttc cagaagttgt tttaagaegg ttggacttgt 2496 
ttcctcgttt tagattgaag attgattgga gcagggaaca ttattgaaca ctgttgtaga 2556 
tacttacaac tgtgaatgga ggagacattt tctgtataga gaggtgaaaa cacaacagct 2616 
ttcttcaatg caggaccaaa ataagacact aaaatgagtg ttcctcttgg cgatctccaa 2676 
acagacgagg taaacgcatg ttactactct aactgeagea tgtataaact atttctcget 2736 
ttcttgtttg atttcttget ctttttcttg tgttaaatgt tatatattgc ctttctggtt 2796 
atgatattcc gttgatgtat ttttgcattg aacaaactga gcatcggtga gcattgtttt 2856 
tcgatacagt cacegtaaag tggcttcttt cagccctttt ggggatttca agectgatea 2 916 
gatgeatgat gaggtttgtg tttactccac acggcgcccg gtttttgggg tgatgcgttt 2976 
tttttaaaca tcatgtctgg acgtgttttt tgtttgtgga ctaaactgaa aggacctttg 3 036 
accataatga ccaaatgatg acattaaaca ggctactcgt atgeagcate accctctctc 3096 
attccactcc atgeacgett caactcgctt actatttcac agatgttcac accggtttgg 3156 
agetgegagg atetegttag caacccggcg ttagaaatga ttgaatcget taaggccctc 3216 
gatgeattec cagaaaaaag aaaatgatgt getaatatge tttaaagaag categggggg 3276 
ctgaattgga cctgtttttt tcttctctct gtatgttttg tgtattaata tgcacactga 3336 
aaaacactat caactgactg gaataataaa ctgtaccact tattttgtta acacctcatt 3396 
aaagtattta agaaaatctc aaaaaaaaaa aaaaaaaaaa a _ 3437 
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<400> 3 

aagctttaca ttgaatagag cagaaggaaa 
tttttatttt tatatataca taagtagcct 
tccgattaaa catttgtatg tattattagc 
catccggaaa gtactcatag cgcttcactt 
taaatggatt aaattattgt ctcaacattc 
tttttttttt aattgttgca aatttattca 
taagtattta cagcctttgc cgtgaagctc 
ctgatcattc ctgagatttt tcagcagctt 
gatttgaaaa cgcatacacc tttctatata 
caaagcacaa accaagcatg aagacaaagg 
agcagatttt tttaaaaaca caaagacatg 
tactgagaaa tctcccaggt caaaaatcct 
taactcttct tcagactgtg tttctctctc 
cactcgagcc agatcaataa ttctcctcaa 
tcagtggcct ccgctcccct tatgtaaaca 
ttgagggctt gcgttgcatc tactggagtc 
ctcaacaccc tccatcaaac agacaaaacc 
acccttcaat atcttaccaa acacatgctg 
catgctaagc aaattcctga aaatgccatg 
aaagcagaaa tctttgtaaa actggactga 
aaagaaaaaa aaattaggaa gttaaattca 
tgttatttgg ttggaaagtg acttcagcgt 
ataatattta gtatgtgtat tttgttaata 
tgtgtgtggt atatggtatt aggcagtggt 
taaagatatg attatatttg ttgaagcttt 
gtcttgctta aatttaccat taaaaggtcc 
tcaattgcct ggaccgaacc taagttccat 
ttgtgttcac acagtctttt ttccttctga 
aataactaag cggcgacact agtgtgattt 
gcggccattt tggaatgaaa attccaatag 
aaataaacta ttaaaagtgt tgatgattgt 
gttcaatgtt aatgcgcttt ttaaataaat 
gcgacagcaa aaagaacaaa tcgatggacg 
aggaggatat ctaagttagt gtgcccaaaa 
aagctgatat aaacatgtaa agcttgtaat 
cctttgttgt tttacaaagg ggaggagcta 
gttgaaataa cgtcaaacat caaataacaa 
cagtttcact tttaattaca attatcatca 
actagttcaa gggtggtagg actgggcgat 
aaaccactaa ttgttttaca catttttgaa 
taaaatgatc atttattcaa aaagattcaa 
ggttatttta tttagaagag atgctgctta 
aataattcat tttgtttcca aaagagcaag 
ttaggcaatg tgtgttttaa tttcagttgt 
attgtagctt gtgcttcgtt cacttatttt 
atctcttgtg atatgtgagc ttatttaccc 
aaataaaata aataaacgaa ataatcgttc 



gagcatgtcc tcactcggac acctgccagg 60 
aaatatgcac ttgtcatctg ttaaaattca 12 0 
atttcgacat ttatctgcgt atatacagtg 180 
attccacatt ttttgttaca gatttattcc 240 
tacacacaat agcccataat gacaatgtga 3 00 
aaataaaaaa cctgaaaaat cacatgtaca 3 50 
taaactgagc tcaggtacat tttgtttcca 420 
aattggagat cacttgtggt aaatttagtt 480 
aggtcctata taagggttga tagtgcttgt 540 
acctgtcttt ccggatgcac tgtagctgtg 600 
catttcatct tcaaacttgg aaactctgcc 660 
ggatcatata ctgatgataa ttatcgggtc 72 0 
tctttccaac acacagttgc tgaattcatc 78 0 
tcactctcac tctctcccat aaacacacac 840 
gagactgatt aggtcattct gctgcttatt 90 0 
tctgatattt acactggagc agacctggag 960 
gcgaggctct tccacctctg acctttcatt 1020 
gatgaagata aagctccaaa gacacaatgt 1080 
tttgcagagg tcagacaggg agtataaaat 1140 
aggcacagtc atattaatcc ttttctgcta 1200 
gtaatcttaa caggaaatca tgcaatcagg 1260 
gtgcgatgct ttatttactg ctttttgaat 1320 
actgaacaag tacacaaata catgatggta 13 80 
tagaaatcaa ctgtaagaaa catctgatca 1440 
acaattcaac gctacttcaa gttacatctt 1500 
aatgagatta agcgggcttt tacacttggt 1560 
ttgcccccct ttgccacctt ctctgctggt 1620 
atccaggtac acttgcatca aagagcatag 1680 
gggaaactcg ccgcggcatc tagtgtctca 1740 
aacaagtctt tagcatatta taagtctgta 18 0 0 
gttagtaagt gttatattgc catctttcag 1860 
aattaaaaaa caaagcagct gcttgccatc 192 0 
attacttttt ctccaaaatg gcagaatagt 1980 
cagtgacaaa atttgcattt agaagatata 2040 
ttgtcacttt cacatatata gagcaaattt 2100 
atccatgtcc cactctctct tcgtgttttg 2160 
tgcacgtttc gaaacaattc aagtcagtgt 222 0 
ccatatatcc agtcaaactg gaggaccaga 22 80 
taatcgttaa gtagtcgaaa tcaacattta 2340 
tgccttgaga cagcatattt tccattacaa 2400 
aacttaaata ttatttacag gatacagata 2460 
ttttatactt tgaatatgaa aaacgttgaa 2520 
ttatttattt atctttattt tttataagtt 2580 
tcaacgttga agttcaataa ataatcagag 2640 
gaaatcaagt aatgcgccct tcattcaaaa 2700 
tataaaactt aaggaactat ggaggacaaa 2760 
atcaatcgta atagagttaa aatgttcaat 2820 



BNSDOCID: <WO 03023048A2J_> 



WO 03/023048 



PCT/US02/28410 



taatcaacat ttgaggccaa atcgcaccag ccctaacaag tggtgtgtct cacatgacta 2880 
aatggaaaca gcaaagcaaa gcaaagcaaa aaaaaaaaag gaaatgcagc tattagatag 2940 
gaagcaaaaa atcaaggggg gtaaaaaatc tgacaacgaa ggtatgtcag agatgaggtt 3 000 
tgcggttcga gctacaggat ggcagaacaa aatacattag gcaggaaaat gacaggtttt 3 0 SO 
ctttcacaga tttaaaatgg aaggaaaaaa aactgaaagt ggaaataaga gcacgatagt 312 0 
ggaagaatta gcctcctcgc ctgaggtgca tgaagtcact gatttcacaa ttacaaaagg 3180 
aagacagaga gacgcaaacc gagaaagaga gagcgaaaga ggaagagaga gagaggcagc 3240 
tgtctggtgt ctgcaccgca gtcgctgtct catcctctgt gggagatcga caggactcca 33 00 
tctgtctgcc agacacgtct acacggcaca ccagccctca aaccattaca gctgcatgtc 33 SO 
tatcactgtc gtacaatcaa acaaacacac atgagctaac tgtatgccac ctaagtggga 3420 
agactaggac agctaaatta atgggatcat gaaaactcca agtacaggca catgcccaaa 3480 
tgactgcgtt cacaccaaag gcgtcgagag catccaagta gctggaaatc attctttttc 3540 
aatgggagcc tgcagcgata agcgacaagt cttcacggtt gtgggcgtcg aggagagttg 3600 
aaataaagtc aacgttatag taatgtgctg tgatgcagtt cagtggcaac caatcggaac 3660 
aagtcatgtt caagtccacc gcttgagagg agtccagaga acacagtcct gtgaactttg 3 720 
gttctgacca cagttgttcc caagggtttg attattgcgg ttactggatt tccaattatt 3780 
tacaaagttt tcttttagga aatgcgtgat gcgcattaat tttttggttt tttcccttta 3840 
aaaaaaaagc aggtatctca tgctaactac aacaacttaa tattttcgac atatggacac 3900 
atattggata agtggagcaa aaccacacca catgaaacaa cttcattttt agttgttaaa 3960 
ttaatttgat aaaagggaca caacactttc acgctagaaa gcatgttctt ttaatgtggg 4020 
attgtaactt taaatacaag attaatgtag tgaattctcg cggccggagc tgtaaagtca 4080 
gacagcgttg tcgccagggt ggccagagtg aactttgaca cactcgccac cctcggtgtg 414 0 
aatgcacagt gacttgcatc gtttttgaca agtgccaagc agactactga aatcatttta 4200 
gaaagaaatt ctaaaatcaa tctaaatata aactggggtg agaaagctaa atatatttgt 4260 
gtttggtaaa taggaaaaaa aagcagtgcg acaagtgttc caatagtgtt tcttgtcagg 4320 
tggtccctat atagcatatg agccttaacc actcgtgctc tattgacctt attgtaaaat 4380 
gcggaagtgc gcctgttttg acgattgttt tagaaattac gattcagtcg cctatgggag 4440 
aaatgactag gaataataaa cggcagaaaa cgatcaaact acttgctcta caaacaaatg 4500 
tttgcatgac tatacagacc aagtagaata atataataag aaaatatcag gtttgcaaca 4560 
tcaaacagcg aaacgagcag tttttaatgt ctaaaaataa atggaagtga atgagactgg 4620 
aagtctcaag ccaaaaagat tcaaatggct gcgccactcg tcggccaaga ataaggtgaa 4680 
tagacaccct aaccgaatac agcgcttgac aacacaaata aatacagact tacaaaaaca 4740 
caaaaactgt ccaagaaggt tcctgcgctg aattctctgg agcagcagct gtgcatcaaa 4800 
acctagccag gcagcatctt taagcatgca catttcacca aaatgaatac aaaaataact 4860 
ggacatctct taagttctaa aactaaatat tttttacttc attattattt acagcaaaat 4920 
cctctaagta aaattcactt agtttagata gtatttgctt cttctctaaa ttaaattaaa 4980 
gttatacaag ttaatgagac aacgaaagga ttaataaggt gatgattgtg cactgatgat 5040 
gaacacctgc tgttaataaa caaacacaaa gagaaacaca aaactacaac tgacttcagt 5100 
cacagccttt gataaaatca actgaaatat aaaagattga atctctcaaa atctcagcag 5160 
aggatcatta agcaactcaa caaagagcaa ctttacttat aattgtcctt tgagggacca 5220 
aatgctctct gaactgagta caatatactc agaggaattt gctgtgtaat tgtaattaat 5280 
atgttattat tgttgctatt tgagggatat ttaaatcacc aattttctca gaaatatgct 5340 
ccattttgta gttttactaa gctttttgat aatgctctga aaacaaatat atatatatat 5400 
atatacttat ataaataata agcaataaaa tcttaaaata tttcaggcat gtttcaaaat 5460 
ctagcgatct gtcaacctag gttgaattta agggccttat agactatgcc accagaatat 5520 
gccacctcct ctggtcaact cactagtttt aaaaaaccca gccagtgttt ccaaaagcac 5580 
agcagacagg ggagtcagtg tccttgataa agtggaagag gactgcatcc gagcagactg 5640 
aagtgccgtg tgctgaagtc ttgttatgag ggtctgatcg tcaccaacat ccacgcggac 5700 
acctgtggaa actctgagtc acttgttctg ccgctccggc atcacagatg tttagagtta 5760 
acatttcacc cagtaaactg atgagcctgt ctgccctgct tcccgtacac acacacacac 5820 
acacacacac agacgcccgc ataacagtga cttcttgatt tggacaccag cagaagagta 5880 
tgcatatgtt gtttctcttt agcggctaga cccgtgtaag ctcagcgtct aggtaatgac 5940 
ctttcacgta agcacatatg cctgtttacg gttcctccaa aacagtttaa caaatcttgc 6000 
aagcgcctgt gtacttgaca ccacttcaat cttaagcagt gattgttaat ttaaaagtga 6060 
ggaaaaaagc gagatacaat ttatgtttta aaaacttttc tgccttggca ataaactgga 6120 
acaagtaagt tgaagacaag tttagcagtt gcaaatttta gttggaagtt attaatagaa 6180 
tatttttgaa cattgtttta gtaatctgtg aattaatagt aattgtgaag ttaagaaatg 6240 
ctttattttt attttagtta gtatatttac tttaaaagca aagcaaagat gggaaatatt 63 00 



-7- 



agcttaagtg tttttgcaat cttattttta 
aaacttatca gatacctaga atttaataaa 
gacgataacc attttcaagg tataacacgt 
tccagcagaa aatatatcca agatgccgtt 
agaaacaaat taaaacagaa gtcaaggaat 
tgctctaaaa tattttaaat gtctcttaaa 
aaaaaaaaac tttcagtttt tacccagaca 
cacaatacag tcaaagcgtg atgtttttat 
ccggcccatg cctaattaca aggaaaacat 
taatttttaa aacaacatat atttgtgtat 
gaacatgtaa aacacaataa aaggtattgt 
ttttccactg aacgcaaaaa aatacatttt 
tcacttctat agcaggaaaa acaaatacta 
atctacaaaa aaaaaagttc cactgttttt 
cccctcacaa atctatcttt caaatttata 
ttatgcataa tacattagat tagtcagtac 
ataacttacg ataatgatac aaaaacttgt 
ttaaatacaa attttaaaat gaggaaaaat 
ttgatatatt gtagtttgta catttttttt 
gtattatctt tcaattacta aattggtttg 
ctgtttaata aatctgtttt gtttaaatgc 
tgtgaataaa attattgatt gtcaaaatgg 
aaataaacta aagacaggtg tgaaataagt 
ttttgggtga actatccctt tttgggtcat 
gcttacagga agtaaaaaaa aaaaaggaaa 
gccagacagg gacgtgtcgg gagttcggca 
tcagtgactc actcgtgttt ctgggagttt 
gacgctctca ggggaagttt agcgtgtgcg 
cagaagtgag ccgagagtta cagtgtagct 
tgtgtagcag atgtgactgt atgactgaca 
tgtgttgggg gagggtggaa ggaggaatgc 
tatttccggc acacttgcat aagcacacag 
agacagaaca caatcatttg ctctcgtcgg 
acattacagc acagttaaac cattaactgc 
actctggcag caccgacagg gggtcaatct 
agctctggct fcggtcaagtg ggtcatgaga 
agatcttttt ttgcactagg ttgacctttc 
tgagttttgt ttcaaaggtt gcattatgaa 
gattttgcaa ttacacaatg caggaatttc 
aaaatataga actttatttg tcaacactga 
attgtagccg ttgcagatta ggcatgggac 
ttttactttt ttgggaaaac agtatctcca 
aatttaaaat gtaaataaat tttaagagta 
gaactattta aggctgattt atacttctgc 
cgcatagccc ttcaccgtgg ccgttggcga 
tacacgtggc aatgacgctt agcgcaagct 
agtctgggtg ggaccgagag ccgcgcaaat 
agtgtggagt tccgtgaagg agctccggat 
ttaaagttgt tgtacgtccg ccagttcctg 
acatccccga agcgttcagg aaaagcaaaa 
tacacctcac tgccaactag cgtttcggaa 
cagacatata aatacacagc tacgcgcgtt 
acgcagaagt ttaaaccagg ctttagccta 
gtttctgaaa acaaaatata ttggctaaaa 
aatttaaatt tacaacccga caaaaacgga 
gaatggagct tggacgtagt acagatgcca 
cactcccatt cataaattct ctaacggtgc 
cacacttcag aatctcaccg cactctaggt 



aatattttaa tttaatctta cttaatttgt 63 60 
attttattgt tgtatcacaa ataggcatga 6420 
ttggaaattt ttttttagga caacagtatc 6480 
ttaaatggta aagggaagaa acaaagtttt 6540 
catttgaatt atttaccctg acatgtttac 6600 
atataatata atataatgta ttcaaaggag 6660 
tttaaaaaga acgtatttta gagcagtaat 6720 
ctaaggttat cataccgtta gaatcttata 6780 
agcgtgccga taaattgtaa aaatcataaa 6840 
gtaagatcat aacgttaaaa aactgtgtta 6900 
ttatatgaaa atgaaacctt tttttttttt 6960 
caagaaattt tcaagaaaac cttaaaccat 702 0 
tagaagtcaa tggttacaag gttttcagat 7080 
aatatacagt gctcagcata tataactaga 7140 
attttaatat aaagctatac aatattatat 7200 
tgaagccaaa tttggaaatt atctaacaaa 7260 
acacccaaac acatatgtta tagagaaaca 732 0 
caagagaaga aaaaaaattg aaaaaagttg 73 80 
ttttgcattt ttttttgctt gaatttaact 7440 
gtgactaaaa tactatttta ataaacattt 7500 
accaaaatac attgcctata ttcaccgaga 7560 
ggtctactca tttatgctga gcagtgtaaa 762 0 
gaagggtgag taaataagga cagaatacat 768 0 
tgcatcttaa aataaaactc aaagttaaga 7740 
aaaaaagcat gaagtcaggt gcaagacact 7800 
gttaacagct cgctgtgcaa agagcaggtg 7860 
gatatctcac cctcagctga gcggagcaga 792 0 
tgtgaatcaa tttcattagt gtgcgttaag 798 0 
ggctgtttgt gtgtcagggt gaggtcagtc 8040 
gactcaaggt ctgttttagt acatgaagtg 8100 
ggtttgggtg ctagacgaga caccagacgg 8160 
gagagactga ggcttttctt ccgcacacac 8220 
aacggaagaa aaataattgt gtgtgctgga 82 8 0 
tgccacttgc attcgaaaaa gaaaaaaggg 8340 
ttgtctaaca tgagcacaaa aaactagaaa 8400 
catggctggt caatgggagg gaaacaatcc 8460 
tatagattct acagcacgag gacactttgt 8520 
agtggaagaa tctccagtgt tctgcttttg 8580 
cccatctaaa gggacaaatt cacagatggg 8640 
tttcctgtca gtgtgcaaga ttttcgaaac 870 0 
aatgacattt tgaaatgaat tttttttttt 8760 
gcagaaaaga tattcaacga tgctgtttta 8820 
atgaaaacag caaaagtcaa tggttcattt 8880 
gtcaaatgca cgcgtatgct acggtgctga 8940 
cgctgacgtg cacctctcaa aaaatttaac 9000 
ctgtgattga tcggcttggt agcgctgacg 9060 
ggtgcgagcc tgatagagcg attgtttaca 9120 
ggaaagtttt gttttgtgtt tacctcatag 9180 
cctctaaatg agcgagtttg agccacttgt 9240 
cagtgaagaa actcgacaaa gaggaacatt 9300 
gtgttaatgc agatcaacag agacagggcg 9360 
acatgcgccg tgggttacgc aggtcacttg 9420 
acatgtttat tgctccaaaa tattttaaat 9480 
tcgcctacta ctcagtaggt actgcatctg 9540 
cgttctatac agtagtaatg tggctagtat 9600 
ttttgtcatt atcacgtgac atacccgctt 9660 
atcatgggat agcgtagcgt ccatcggata 9720 
catccggata cttctcacat actgattttc 9780 
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gaattctatg aattcaaaca tactactcgg 
tgtgattttg tacggatcca ttgtgttcaa 
gacatttcaa aagatttttt tagagcagta 
tttttatcaa aggttaccat accgtctgaa 
aaatactaga agtaacaaag tggtatgtgt 
acaatctcag actggaaatc agaaaaaagt 
ttgccactag atagactaga ttaaatacaa 
tcatctcatt atgcatatat taattaagga 
ctgttgcata ttttttaatg ctttgaatat 
aagaaaaata aatacaggta ctagaaaacc 
ttttaatgtg tgctgtcatc attcaccccc 
agataatgta cttccccttc agccagacca 
ccaaccaagc aagcgtcacc tgctcgtgcc 
tcaaaacaac tgcagccttg cccagacaat 
cattaccatc tggtcacaca cgtgcaaatc 
tagtgaacca gaacacatgc aagacaaaaa 
ctggtagtga aacttctcct tcatctcgtc 
tgtgccctca tgcttgcgcc ccagctgaag 
ccagtggcag gtcctcttgt caggggactt 
aaacagcagc tggcactcca tgcactgaat 
gggcacaaac aggccttggc acttgccaag 
actctccttc agcaggcctt ggcctggcag 
acctgggcga agcagggcat tgcagagccg 
ggaaggtgca ttaaaaggca aaatgcccag 
acaccgcgaa cagtaaacgt aaagttcatc 
gtctcggagc actgaattga gtacctgggg 
aaagcaggag atggactctc cttcaaggag 
gggcactaag agtgggcctc ctccgaggac 
aggagaggcg cactcctgtg ccgattttgc 
ctggctgctc agcgtgaact gtgccagcgt 
cttggatgct ccgacaatgt ggaattctct 
ctcctcacgc tccttcttga cctttagggg 
caaatgcatc tctgccatta cccgtttttt 
cagcctcttc atgttgcctt taaatgaggt 
ctgagggctg gccatatcca ttcacagata 
cagacacgac acctttgggt tggtcctact 
tttacgtcta gctacatgca fcgatgctgac 
atgaaaaggt aaaaaggctt gtattaaaat 
aatgtgaagc cacgtgaatg cggatgaatg 
ccagaagcgt tttagcgttg ggcacagtgc 
atcaggataa tctaatttga aagcaatcga 
gctgtttttc tctactgaca acatgccgaa 
taataacgtc ggtttaacac acagataacc 
atttagtcct gtccaaccac agcaggcttg 
cacttctgcc acgcaagatt aaatgtattt 
atcgtacaag caaaagcctg aactggcgga 
aaagctcagc ctcaaattca gcgcatgcca 
cgcgtttaat ttcagaatca caacatgcgt 
tggcgaaaat taaagggcca ccaaagcata 
ggttttgctt taacgttacc agatttgcgg 
gacgtcgttt catttcctca taaaataacc 
tatttttaac cctgaatgac agcgtctctc 
gtccccgcct ctacccgtct ctcccctctc 
gcggtgtgac aatcaccatc cccgcaatcg 
gatcacggct attacacctt aaatatccca 
cattcccatg tcttcattcg gcagtgaaaa 
ctgaccaccc cctcaaaata cgttaaacag 
gaaatacaga catttgtatt aaacacaata 



ctcgcatact gtttttagca tatggaagta 9840 
agcgaaaaaa aaacgttgtt gctttactca 9900 
atcacaatac tgtgaaacca aaccatgata 9960 
ttttataccc acccatccct attgtagata 1002 0 
gcagaaaaac actgttcgtg atcatttagt 10080 
tatttaaatc catttccttt aaacttaaca 10140 
aaatatatat tattatacat tatattctta 10200 
ttttggatgg aaagggtaag tgtaaaactg 10260 
ttaaatacag atttttgaat tgcaatttat 10320 
ttgaacatcc aaggtaacaa aattccacca 10380 
taaagctgac cctgtactga cctgagagga 10440 
actggaattc cctccctccg gccatacgtc 10500 
accccataaa attgggtcag cactagttta 10560 
agcaaacagt ctcctctgcg gactatcaca 10620 
cacttgtgaa catgaagcaa aacttgacct 10680 
gacgcttact ttgtccagag gcctctttag 10740 
cagcagctgc ttgagctgtg gctcctcctg 10800 
gtagcagggc cacttggctg aatcgaagcc 10860 
gtgtgagtgc attacaaagt tctgtgggga 10920 
gcaaggggca tccggctgga cgtagaaatg 10980 
acactggtgc tccacctgga aactggcgtc 11040 
tttgctgttg ggatcggctg agatggtggc 11100 
ttgggcatca gttagggtaa tcaggccaca 11160 
caccttgaga atgtgcagct gctccgcatc 1122 0 
acaaacagca ttgatctgct gcagagagaa 11280 
cagacaaaga cgcttctcac cacctacaac 11340 
agaatgggta agctctgtgg agctgtcaca 11400 
tgggggagac ggtaggggag ccatgccagc 11460 
tgagaaagca gcaggacctc ccagggagct 11520 
gtgcttgagg ccggcgctca gatctaaagc 11580 
ttcctccatg tccatttctg agctggtgtg 11640 
tttggggatg ctctccaggc tgcgacgttt 11700 
aatgggagca tcttccagtc gctccctgta 11760 
caggtcttta aaaggggttt tcagacctgt 11820 
actggtcttc cttgggtgga aaataatcgt 11880 
ttgctcgctg tgttaaaggt tgcaaatcct 11940 
agaattaaat gttcaacgag cttcctgtga 12000 
ataattctta ctaaatcaaa cacaaatcag 12 060 
gaaaatgctg ccagagcaat taaaaggcat 12120 
taaactagac gacagaaaca ttttcacctc 12180 
tctaaggcca gtttctcgct ggtttagtgc 12240 
tccgtcaagg cggtaaatga gtcagtgcac 123 0 0 
aacagcatca gtaactaaca gcgagcagcc 12360 
aaacaaaatg tattaacgac acataaacac 1242 0 
attatattgt aaataaaaaa ttatgttttc 12480 
atttccgagc acgaatatga aggcaaaatg 12 540 
gtggggtatc ggcaaaatcc atcagaaaaa 12600 
ttctgttcta caaacggaat gcattcattt 12 660 
ataatctcaa gttcctcgaa ataaacgctt 1272 0 
gtcaactcaa acagactgac cctgaccgca 12 780 
agtttccaga aacactggca aaacgcggtc 12 84 0 
tttttacccc cgcaaggaca ccgcggctct 12 900 
cttctccggg cagagctgcg ctgagcaccc 12 960 
ggaaaaccct tcacatgaaa aaaaaaacct 13 02 0 
acacgacatc taaagcttct attttaaatc 13 080 
cgtttgtcaa ccattcagcc agtctgacac 13140 
cgatatcccc aaaactgttt ttgaaaaaca 132 00 
tagatcatat ctaagattta aatcaatcac 13260 
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acaaatactc ccagctcgat caccaatatg accaggttaa acacataata cacacgtata 13320 
ttatgccctt ttaaatggca catcaaaagc accaaaagca ccaaaaccgg tttaacatct 133 80 
gaaaaactgg ctcgttcgca cagacagctt tagcagtgcg gctagaatat ttagaccctt 13440 
tagcacgcgc taaccgagct aacgctacaa atccccgttc agagacagaa ataaacgtca 13500 
cttaactatt acaaaccttc aaaacggttt taaaaaacac agcgacaaac tgctgaacca 13560 
ccagagtaat ggacgcagtg tataggcgaa tcagatgtgg gttttaaagg gacttgcctc 13 620 
aggatgaacg atctgctctg ctgtctctct cggtctctcc tctagtctgt ctgagcgtca 13 680 
acaacacgcg cacacactca aagccttcag acagatcgcg cgtcacacac agcgagagcg 13740 
cgcactggcg gagcggatga tgttgcgcgg gcgtgtctat gtatgtgtgt ctgtctgtct 13800 
gtctgtcctg tctgtctgca gcgcgccgcc agattctcat gaactttgca agaatttaca 13 860 
ggttacagta aatgtaatgt aatgcatgta atacatgttc agttgtcatc tgtggtctac 13920 
tagtaacttg cgcttgcaca catgactaac caagggacat ttctttagaa ataagggagt 13980 
tttgctgtct acttagggac agtcatagtg gagcatgctg aatttgaatc actttatggt 14040 
aagagaaagg agtaaatgga aattaaatag taactttagc cacatcccca ttcacgctat 14100 
gacttagcat gatttttgag gaataatgaa tggcagcatg agctgaggtc tgtcaatgtt 14160 
tcacagataa tgactcatct gaaatgggct gtttaaatga aatacatatc atgcaaaaca 1422 0 
tcacagtttt tgtacttggt aaagacataa aacattattc tgttttacag aattatagtg 142 80 
ttgagggcgt aacacattac aagtaacgag ttacgtaata atattacttt tttaagtaat 14340 
gagtaatgca tatttttaaa aattaagtaa tactatttga gtgacttttt agcttaatta 14400 
attagcttat aaaaacaaat ggctgaaata aaatcaatgt cagattgaat cccacactca 14460 
atgatagaat gcaggaaaag acagaaacca acatggcaga gccttatatt tctgcaatgg 14520 
gagttttctt attattctga acgaatcaca gaaaaggaaa acaggattat agtcttaatg 14580 
gatagtgatt ttactttatt aatagcacaa aaaagacaaa agtagtagtc acattgcatt 14640 
agactttcat tattacctgt ataggcatat atccttgagg gtcaggaagt tctcaaaaga 14700 
taagattatc ctacattatt attttatgat gtgtttttaa aaggtaaatg aaggtgtagg 14760 
ttatttagtg cgatgtttat tgcagagctg ctttatttta aaaaagaagg aaaaaatcct 14820 
gcaagatctg aaacagatca agtctcagcc aggtcagaaa aggtaatgca aaagtaactc 148 8 0 
aaaagtagcg tgacacatta cttattataa aaagtaactt agtaatgtaa cttacttttt 14940 
tgaggagtaa ctaaaaattg taatgcatta ctttgaaaaa taactttccc caataacaat 150 00 
ttacagttga agtcagtatt ctagccctcc tttgaacttt gttttctttt taaaatattt 15060 
tacaaattat gtttaacaga tgtctggtaa tatttttttt tgttctggag aaagtcttac 15120 
ctgttatatt taggctatta attcaggagg gcaaataatt cttacttcaa ctgtagttca 15180 
aaagttttag gaaatgggtt tcagaaaatc atgctttttt tcagtaaata ttcagctttg 15240 
aatcactggc ataaatagca ttttaaacct attcttaatt aaaatgattfc tcatttatat 15300 
tatttctaaa ccaaaaaaaa atgtataaaa tatgacaaaa attgcgaaaa atgacttaca 15360 
gattttattg tatttattta caaaaaaata cccatttagc tttgtttatt ttatttctat 15420 
ttcagtttta gctttaattt ctgtttactt tcaatgtgtt ttgttaagtt gaaaaattat 15480 
tagttttttt tttttacatt tttaaaatat gtttttctat atttccattt tacttgtaat 15540 
ttttataata cattgtgtat tgtgctgtca ttgtttctta aactatttat tcattcattc 15600 
attttctttt tggtttagtc cctttattaa tctggggttg ccagagcgga atgaatcgcc 15660 
aatttatcca gcacatgttt taagcagcgg atggccttcc agctgtaacc catctctggg 1572 0 
aaacatgcat acagtcattc acactcatac actaggccta tttagcctgc ccaattcaca 15780 
tgtaccgcat gtctttggac tgtgggggaa accctaacgc agggagaaca tgcaaactcc 15840 
acacagaaac gccaactgac tcagccgagg ctcaaaccag caaccttctt gctgtcaggt 15900 
gacagtgcta cctactgcgc cactgtgtcg ccttaaacta ttttttattg ttttattgtt 15960 
atttttggat ttgaaaattt gtgaaaatta aatcacatta aacaactaaa ctgaacttca 16020 
tctcagaaaa ctgggctgtc attttcaatt tactagaact tctatgttaa gcaactttaa 16080 
tagaatctac attgtaaaag cgctataaaa ataaagattg tagttttaat aatttatcgt 16140 
gtactgtact gtgctgtcat tgtttttgta tactatgtta gtttgaattt ttttatttca 16200 
aatttggttt taattcagtt ttagttttac acctccattt gggaaattaa ttccaagttt 16260 
taattatttc acctataagg atacattttc agactcatta agttttgata aaatatttta 16320 
tttcattttg cattttattt cttgttcaca tttttaatta ttgttgtcct tttttatttt 16380 
aatcgaaagt tgtagtaagt acaaattccc catgtccact tcattcattc atttatttat 16440 
tttccttggg cttagtccct ttattcatca ggggtcgcca cagtggaatg aaccgctaac 16500 
ttatcaagca tatgtttcac gcatgaactt ccagctgcaa cccagttctg ggaaacaccc 16560 
atacactctc acatacacta aagccaattt agcttattca attcactttt agcgcatgtc 16620 
cttggactgt gagggaaaac agagcacttg gaggaaaccc gtgcgaacac agggagaact 16680 
tgcaaactcc acacagaaat gccaactgac ccagccagga ctcaaattag tgacctactt 16740 
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gcggtgaggc gatggtgtaa tcaactccac tcaaatgaaa tatattgtga agtaaagtct 16800 
acgcatcaat ccacagaaga tttttcatct cactctgtgg ccataacagc ttaaaaagaa 16860 
ctttatcgga tttgatgtcc gaattaagct cattttaaac cacccagaca tatttatgga 1692 0 
cattagtgtg tgagtaactg ttacgctagt acaaatgttg tggaggcttc agccaaggac 16980 
attaagtaat aacactagaa cattctctat ggctgagtaa tggcctggag ccaatccagc 17040 
atttcgctcg gtccacaggg cttcaaccat ccaatggcct tcatgttttt gtcaatggac 17100 
ccatgatggg gtgctgggat cacctttgct ttggtttcca tctctgtgtg agccatttgt 17160 
taggtgtcat ttaaatgtta aatgggagag gcagagcatt cagcagctcc ttcacaagcc 17220 
cagagcagca cattaacact tttaaaagct cctcctccac actgacccag gtcatgttca 17280 
ggcaagacag ctttcgggag attccaacag ttttgatgtt tgtgctccta ggtgttgtgt 17340 
tttttcttta gacttaaagc tcacacaaaa gagcaagggc ttatgttgca aaatcttaca 17400 
tttcacttaa aaattgtctt gttgttgcgg aacggcaaac ctttattgga ataatagagg 17460 
agaaaataca tagtgtgtgt gtttaataaa gtgctttcaa fctaattaaag aaaaaaacta 1752 0 
atttagagat ttgtataatt aggtttaaat attaaaatcc aaaaattttt aaattatatg 17580 
taaataatta caaataagca aaaaaatgct caaaatagag tgtatgggtt aataggttgt 17640 
ttaaatgaat aatatacatg cacgtatata catttttatt agtgtacttg tatactataa 17700 
caacaaaata ttacactacc tgacaaaagt cttgtcttct atcccagttg taagaacaac 17760 
aaataattac ttgtcttgta gttgatcatt tggaaaagtg gtagaaggta gatgcagaaa 17820 
gtagattttt cagatgaatc atctgttgaa ctgcatcacg atcatcacaa atactgcaga 17880 
agacctattg gaacccgcat ggacccaaga ttcttacaga aatcagtcaa gtttggtgag 17940 
agaaaaatca tggtttggag ttaaattcag tttggaggtg tggaaaagat ctgcagagtg 18000 
gatggcaaca ttaacagcct gaggtatcta ccctgaggta gcctgaggta ttacaaacca 18060 
caagagaggg taaattcttc agcaggatag aactcctcat acttcagctt ccacatcaaa 18120 
gttcctgaaa gcaaagaagg ttaaggtgct ccaggattgg ccagcacagt caccagacat 18180 
gaacatattg agcatggggt aagatgaaag aaaaggcact gaagatgaat ccgaagaatc 18240 
ttgatgaact ccgggagtcc tgcatgaacg ctttcttggc cattccaaat gactttatta 183 00 
ataagtgatt tgagtcactg cagagatgga tgcagtcctc caagctcatg ggagtcatat 18360 
acaatattca ttcttttacc accgcaccat gacattatat tctatcctga acattatttc 18420 
tgttcagacc ttactgtctt aattaaatca ttaaaaatca aagcatgatc atattttatt 18480 
ttggtcaaat aagcagaaat tttaacacct ttgactttca tataaatcac ttctgtaacc 18540 
aaatgatcaa ctagtagtaa agttattatt tgttgttcct aaaacttgga taggcgacaa 18600 
tacatttgtc aggtagtgta aatatatata tatatatata tatatatata tatatatata 18660 
tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 1872 0 
tatatatata tatatatata tatatatata tatnnatgta tntanimnnn imimiinxinrm 1878 0 
imnixnniuiiin rmnnnnnnnn nirnruinnixnii. nimnnnnnnn rmnnnnnriiin nnnnrmnnnn 18840 
gtctaatctc cttggtgtta atatctatcc atccacccat ccatccatct catccatccg 18900 
tcacttgctc aaaatggata actcattcct caaaagcaag tattgatgtc agtcaaagtg 18960 
tcagtgtaag atgaaaagtc ctgacagcat tgtttatgaa caagatagtc aaatggcttt 1902 0 
gtcatgtttt catggtttca cagtttacac taagtttttt tccaatgcaa taaagtcaga 19080 
ttttggtaac acttgctgaa aatgctcaag acaggactat atactattgg cacagccatt 19140 
tgaaaactac agtaaagtta agacatccct gcatttaatg agggatttga gtacaagaca 192 00 
ctgaatatgt aagtttcaca ttttactgca tttgcatttt gcaatactaa tttagtcaca 19260 
gcattgtgca gaagaataaa cacaccccta tttacaacaa acataaattt cctttgggta 19320 
gagctgtgca caactggaaa caatattgca gtacatatta caactgaacc tacaacatac 19380 
atggtgtagg tctgtaaatc attcatcaaa ttgttcaatt tagaagacat tttcaggatt 19440 
tttttttttt ttttttactt tttaaaaaaa gatttaattt acatttgaca gattttgaca 19500 
tctagttcaa aatttttgca tgtaatgact ccagtaatga aatgaggact attagtttta 19560 
tataaaatga ctattcagca tcaattaatt ttgactgaca tgacattagc aaattaaaat 19620 
aatataaaac agcagagagt tgtatgaaag caattaatgc atgtcaaaaa gcatctgcag 19680 
tttattggaa ggaataagaa actgctatta tgatgtgcac aaatgaacta attgctttga 19740 
gaaatgcatg aactaactgt tgtgcaaatg taaatagtgt tgggagaaat gcactaaagc 19800 
gactgagaaa aactgtaatc taaaataatt taaggtaatg caaaaactat tttaaaatac 19860 
ttcatattat attataatgc tctaaattaa attaactaaa acaaagcatc taagaaaata 19920 
tactttgtat ttgaaaaaaa agtcaaaatg aataaatgta tatatgtata tatatatata 19980 
tatatatata tatgtgtgtg tgtgtgtgtg niumnnnnnn nnnnnnnnixn niumnnnriiin 20040 
imnnnimniiii niumnimnnn imnnnnnnnn nnimnnrinrm imnirnnnimn nnrmimnnnn 20100 
gacggacgga cggacggacg gacggacgga cggacggacg gacggacgga cggacagaca 20160 
gatagataga cagatagata gacagacaga cagacagata gacagacaga tagatagata 20220 
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gatagacaga cagacagaca gacagacaga cagatagata gacagagacg gacagacaga 20280 
cagacagaca gacagataga tagatagata gatagataga tagacagaga cggacagaca 20340 
gacagacaga cagacagaca gacagacaga cagatagcat agatagatag atagatagat 20400 
agatagatag atagatagat agatagatag atggatagat ggatagatgg atatatggat 20460 
atatggatag acagatggtc agacggaggg acggaaaaac taatagacag acagaaatgt 20520 
agatagataa atagatagac agacagacag accaaggata ttagagatag atcaaagaga 20580 
ttaaagatag acagataata taatataata taatataata taatataata taatataata 20640 
tgaacaagat agttgtcaac aagctactaa ctatgatttg aacatgatgt gtatcaaatc 20700 
aaatgtcctg agctcaatgt cagcaggctg tgtgtggttt gatgctccag tgatcaaggc 20760 
ccatttcagt ggcgttttca gctctggctg cagatagcaa ggtcagcaga tgtattataa 20820 
agcaggtgcg cccacacaca cttgcacaca aacacacacg cacacacaga actccaaaca 20880 
ctgccacatt gatttttttt ccagagggta ataacatcat cccggtattg tgaggagtaa 20940 
tttaaactta atatgggtaa atactctcca atagctgata gttcatttaa agcttgacat 21000 
aatgctgttt gtgtgcagga gaatgaagga taaagaatgg cagggaaatg gattaagcgc 21060 
tgatgggcca cacagttagt caggcagctg tcttcattac agactatggg actaaaccct 2112 0 
aagcaccccc taaggacgga gcttcagagg aaatctatag ctaaatgcat catcagagag 21180 
acaaacaaac gcttattgcc aacactactg ctacatgtgc ctgcgacctg ctccgaaaag 2124 0 
gagattctgc ttgttgttta aacagatgaa gtaaaggaaa cgcacctgga tccctcctcc 213 0 0 
ctaaagagcc attcaaaacg gcagatgtgt ttggtgtaaa tagaaggatt tttttttagg 21360 
atgtgtgggt gcttccaaca acatcaacaa aaaaagtgga atcatacata caagatgcta 21420 
tgcaaatcat actaatagaa gttaaaaatg taaacatttt gataaaaaac aatttttcaa 21480 
atgtgtttta aaaaaaaaaa agttatggcc aaatttagaa ttcattcacc ccagagtgga 21540 
tgaaaatatg cccaacagca cataagggtt aacgtaagtt gcaaacaatg tatataggct 21600 
gaatgtaaat taaatgtaga cattaaattg gaaattaaat gttgtaatgt tcaactaaat 21660 
ttctttgttt aaattcagcc ccagctgatc tcacgggaaa acagaagtat tttacgtttt 21720 
gccagttaat tcagtcgtac gaaattgtac gattttaaaa aggaggctcg gcacctaacc 21780 
ccacccctaa acgcaaccgt cactggggga tgaggaaatc gtactaaatt gtacaaatta 21840 
gatcgtacga attcttacga attatccact aaatcaaaaa gttacaaatt gccgtgagat 21900 
tgtgttgatc agcccataga aattgtttgc aaccacttac cttaaaaatt tagtaaatcc 21960 
aatgattcat tttttagtgc atgtaacata cttggttttc aagattgttc tctgttaatt 22 02 0 
tgcaagaaat gctctttaca caaatagaac ttccccatac ccctcaggta agatggacct 22080 
ttttaggttc tcagtcacaa aaaacctccc agaaagattt tatgaaggtt cctttttgtt 22140 
caccagaaaa gtatacccaa agaacgctct caaaatgttc caatttggtt ccctaaaaat 22200 
aagaaaacag gtaccaaatg gtaccttctg ggaaatgttc tgttgcctgg gttcaaagga 222 60 
aatcccacag ccactctttt ttctcacagc gtatcatcgt tctctaccac taatcaaaca 22320 
aggtcacaca tctccagttt ggagggcagc tcggctctgt cttccgcagg gaatcgtgtg 22380 
ccctgttctc cagagaaaga ggacacggag acaatgtcac agcagaaaac agccacaatc 22440 
atcgctcctc cagggcttta cagtaacggc caaagtcaca caaggtgagt cgtcaaaggc 22500 
agacgcacga ggtgaagccc tcaaccatct ggacaacaag atgcagctcc acacattgcc 22560 
atgttctaca tatgtttatt taaatcacag gaagtggctt tcacttttgt aaacttcgaa 22620 
gtctgaatat gcatgcacag aggccactgc ttcatattac tgtaacggat ttctgcatca 22680 
tgtttacaga caactcaagt tcataggaaa ttacagcaaa gaaacagtct gattgacgtc 22740 
tgaacttttt tgagcttcgt ttgtttagtc tacaaatgcg cattatatga cttgtgagag 22800 
tggttctgat ttcgctatat gaaacaagat atactgtatg cataatgaac taaactccaa 22 860 
aaataacttc aaaccaagtt caaactactt atttgaaatt attagagttg aaaacacaat 22 92 0 
tcttacattg tttcttaaaa caactttgtt caaccaaaga ctatataata tacagtcaag 22980 
acgtgtcact cgtaaagttt tgaacgggta aaattgtgac ggtcaatatg acgaatgaag 23 040 
cactgccttt tagtacaaga gccaatcatt caattcaatt tatctttatt tctatagcgc 23100 
ttttacagtg aagattggta aaaagcagca taatatagaa gttctagcaa actgaaaatg 23160 
ttagtccagt tttcagtttg gttcggttca gtttggttta attttcactt tcaaaagtcc 23220 
aaaatactga agggcaaatc catcgatgtg cagctccact acttacaaac caagcaagcc 23280 
agtggcgaga aacaaacttc accaattgac aaaagtgtag agagaaaaaa aaactcaaga 23340 
gaaaccaggc tcagttgggc aagtacagtt ttctggccaa acttcatgtg cattactgca 23400 
gtctaggcac cagagggtgg agaatgcagg atgtggagaa agaagagacg tggagaagct 23460 
gcaggtgcga cggctgttta ggctggccac aagattgatg cacagactcg tctgtcactg 23520 
gagtttttca ggactcagtc ctatgctctc cacttcttta tgactgctcc agaatctgct 23580 
caggatatgg cctggtcaag gattatggag acctctagaa gtcctctatg gttggaatca 23 640 
tctcctctga ggtctcaaaa gatctcaaga aaagttatca ttgatcgcta tagatcgatg 23700 
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attctccagg aaaggggctt ggtttcaaca gatcttgtgt gattcgaaca tttataaata 23760 
aactaaatgc agcgatgccc aaactttttc ttatgaaggg ccataaacct tgattgggca 23820 
gaaggcaaat atagtttcca tgtgtaattt ccttatttat ttaaaatgat ttaaaaatgt 23880 
ctagactaca ttgctttata tgaacttata cagtattttt aacattttac aacgaactta 23940 
ttacactaaa tatctaattt ttgccttgat ttgctcgtcg atgtcttctg catagtccgt 24000 
cgagatcgta cgaaaggtcc gtcatttaaa ttttttaaaa tctgattcat ttacaacatt 24060 
taatttaagg atttaatttc agtttgcttt ttttgtagct cagcaacaaa acaaacaaat 24120 
taaaactgca agcagcaata aaagggacct tgcactatca aaaaaggtta cgtcaaatta 24180 
gaaataatga tctcggtgtt aaaagcattt gccccaaccc tctccatcat tctcactctc 24240 
atctcagttg gggtggtgaa ccaaatcaaa ggttacaacg ggccaacttt ggcctgcagg 24300 
cccaactttg agcatctctg atctaaaaaa acagtcgttg tttaatatta ttggagattc 24360 
ctaatatgaa atgtaagctt ggcatgcaat tttagagaat ttgttgtttc cccatttaaa 24420 
caaaatgccc gagaggcgtt tcaaagacgg ccgctgagtg aaatgactag ccttaaaggg 24480 
actctgcttc aatccacttc agtttgtaaa aaaaatttta tcgattcttt ttgagacaac 24540 
atgaagaaat tgtgcacaac ccagcatttt tacagtgtag gtcaaaataa gacaaacctt 24600 
cagcatcttt catcgctgcg atgcagttag ctagtctgac actggaggta aaaatgcaag 24660 
atattttttc tttcgtatta catacagatg tttcaccgga fcaaaatctgc ataacgttgc 24720 
caagagcaat tattagtttt atgcagtcct tctgagctga attaattaat tacaagtaac 24780 
aagtcaggtg tgaactgtcc aatcacatca aaacaaactg ctttcacgct tgaaagagaa 24840 
tgagattttc ttaaatactt taatgaggtg ttaacaaaat aagtggtaca gtttattatt 24900 
ccagtcagtt gatagtgttt ttcagtgtgc atattaatac acaaaacata cagagagaag 24960 
aaaaaaacag gtccaattca gccccccgat gcttctttaa agcatattag cacatcattt 25020 
tcttttttct gggaatgcat cgagggcctt aagcgattca atcatttcta acgccgggtt 25080 
gctaacgaga tcctcgcagc tccaaaccgg tgtgaacatc tgtgaaatag taagcgagtt 25140 
gaagcgtgca tggagtggaa tgagagaggg tgatgctgca tacgagtagc ctgtttaatg 25200 
tcatcatttg gtcattatgg tcaaaggtcc tttcagttta gtccacaaac aaaaaacacg 25260 
tccagacatg atgtttaaaa aaaacgcatc accccaaaaa ccgggcgccg tgtggagtaa 25320 
acacaaacct catcatgcat ctgatcaggc ttgaaatccc caaaagggct gaaagaagcc 25380 
actttacggt gactgtatcg aaaaacaatg ctcaccgatg ctcagtttgt tcaatgcaaa 25440 
aatacatcaa cggaatatca taaccagaaa ggcaatatat aacatttaac acaagaaaaa 25500 
gagcaagaaa tcaaacaaga aagcgagaaa tagtttatac atgctgcagt tagagtagta 25560 
acatgcgttt acctcgtctg tttggagatc gccaagagga acactcattt tagtgtctta 25620 
ttttggtcct gcattgaaga aagctgttgt gttttcacct ctctatacag aaaatgtctc 25680 
ctccattcac agttgtaagt atctacaaca gtgttcaata atgttccctg ctccaatcaa 25740 
tcttcaatct aaaacgagga aacaagtcca accgtcttaa aacaacttct ggaaaactct 25800 
gattcaacac attgtgcaca ttctatatag atgacagtac tcttaaataa aaatatagta 25860 
ataaaacaaa gtgttctcat ccttacgcta cagaaaacag ataatcagaa actttcttca 25920 
agaatatccc atgtggtata gagttttaaa atggcttact atatatgctc aaaggattct 25980 
ctttcatatt tacaaaataa aatgcatatc tttagcattg ggaggtcctg cgtaccatca 26040 
cagtcattct tttttccttt agtttgcaga gtccggaaag ctaaaactgt tggtcgaagg 26100 
cactgacaaa tgtcctttca tcagtgctgc tgattaacat gatacaaaaa taattgagaa 2 6160 
aaaaatagac acttaaggac tagtggttac aaatggcaga aagtgaaaga catttggcaa 26220 
agttactgtt tcatctggtc ccaacaaagc aaagtcaggt tcttgttgct tcctttcgtg 26280 
tgattgattg cgaccatgca aataaaggca gcgatgatat gcgagtaaca gagggataaa 26340 
gcgaccgttc acacacactc ctccgcagac atcagcagag ggttgatgta ctcgaagcct 26400 
tcaaactcag actggtcgat cttctttaca gcatcactgg gggaaaggca aacacataat 26460 
caatcagtct taacccttta ataggcatcg taactgtctg cttgaccttt ggtccactga 26520 
ctttcattca aataaaaacc tcaaaatcag cttagagtgg tggtaggaac aaaatttcaa 26580 
taaaaaattc ttagtgtgaa atttaaagag gccaaacaat gttgatatct gaaagcaatt 26640 
ttttgatttc tccactaaag ttcctataga cacatctctt ggtgctgatt ggctgcaagt 26700 
ctgttttggg acttggtcat tttgcatgta aagtcgaaat actctcactg acgcctctgc 26760 
atgagttaat tttttatgtg accattactt agaatgcatc acttcaactt ttccatggca 26820 
atgccacaac accagattta tgtatatccc attttatgta tattctgcaa atattgacaa 26880 
acatcctaat tttattgaga aatatctgat atgccaatac taaaatatgt gtaaggaaat 26940 
ggactaattt aaatgtctaa gacataaaat gaatgtaagt tgtctgaaaa caccaacaat 27000 
attagattta tgacattatt atccagcagc tgatcatgtg acgcactgac tattaatcac 27060 
accagtgtga tcgtgcgcat caaagagtta accactattc tcaaaaagag acaggctata 27120 
aaatatcttc cagaatgttt atgtgtaagc acacagactt tattattcaa aaaccttaat 27180 
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aaagatcata ttgatggggg atcagagctg ctggatttaa aacatggtaa ttcatcatct 27240 
tcacccaaat tacatgaaag taacttactg tttagctgag aaatgaacag agtgaaggtt 27300 
attgcacgat gaatccaaaa ttttcgcctg taatttttgc acattaaaaa ataaattcaa 27360 
cctcacgttg tgtctatcct gttgacagac tgcctctgaa actttcgtcc gttataaaaa 2742 0 
aaaaaattta atcgggtttg attttttaat ttttcacgtc caaaaagacg ttttgagagg 27480 
cgttttgaca gttcagagcc accatacaag caaaaggcaa aacacaaaat ccccttactt 27540 
gagccacaga taactttatg acaaacacag tgcatgatat aagacaaaac atgtcaaaga 2 7500 
gttgaaaacc catatgtctg aatcagtgac tagttcactt ctaacctgct gctgtttggg 27660 
ctgtgtgtgt ctctgtgtat gtgtccatac atttgacgta ctgtccatag acattataat 27720 
agaaagacac ctcatgtttt ttttcggtcc gttgctccaa tgtcaatgtg gccgccattt 27780 
attctaatgt ctatggtact gcaagtctct gttcagtcta tgtgaagcat cacaagcaat 27840 
gtatcaaaat tccactgatt ttctaaaata agtgttgcat ttacattcct ctttcttttt 2790 0 
ctacttcaga aaagagagga gaataaagta tcgctgcttg aattgactcc gaaatgtttt 27960 
gcattttttt actgaatgga ttaattaaca cgcatcggac tcagtatgca agcattgtat 28 020 
ttgttcgtct tacacacgag acgttaaacc gaggtcctga ctctttgtgg ttaaaaatct 28080 
catgacattt cttgtaaaga gcaggggtgt aaccccggtg tcctggtcaa attttcttca 2814 0 
tcggccctta cccatcatgt actcccaatc atccccatcc aatgaattgg ctttatcact 282 0 0 
atctctccac tccaccaata gcttgtgtac aataaataaa tgcatacgta aattttggac 28260 
ttcagtgtgc aaagacctta aacgttttag taacattaac agtgtgacac acatcgtcag 2 832 0 
ataatgtgta acatgattat catagactat cataaacatt agcgtgacgt tttacaaacc 28380 
aataaatgtt atgttttgct agcacttggg taaattaagg ctgaacatta tatcgtttaa 2844 0 
gcatcgatat cgcaatgtgt gtatctgcaa tagtcacata gcaggattag attattttta 28500 
agattaaact ataaggatat tattacattt tattgttatt aaaactgttt atacaataat 28560 
aagcatgttg ttttacattt gactgttcga tttctgtaca tgaatactct tagacttaac 28620 
aaaaaatcat aaagtatcgt ttatccattt gcttttgctt gtaattttta ataatttatg 28680 
cagatccact gcattaaatc atcccagtca atctacattt ccaaatagag ctaataaatt 2 8740 
catctggtaa aattatattc atatcacaat atatattgca gtaaaataaa atatcacaat 2 8800 
gtcagatttt tccagtatca tgcagcccta atttaaatga ctgaaaccta aaattaatgt 2 8 860 
catttttctg aaaacaccaa caatgttaca tttatgatat tattattcag cagctgatta 28920 
tgtgacacat tgatcatttt aatcacacca gtgtgatcgt gcgcatcaaa gagttagcca 28980 
ctattttcaa acagagacag gctgtaaaat atcttccagg aaaactgggt tatgtggaaa 29040 
tgagtgaatc gagtctgtaa tgtgtcatca tacatcgtaa aaagcaatat tacttcctga 29100 
tgaaagctgc attgctgtgg tgaagatgtg aagtagatca atggcgtctg agcctgtgat 29160 
tcagaaagca agaggcgtga cagtgacaga tgtacatgaa cagacactca ctcatcgtca 29220 
ggcgtgagct gaatgggctc gttggtgaac tgggcatcaa agttatccag accaaactcg 29280 
cccgagatgt taggtttgaa cggtggaact acttgcttct gctccatctg aaatacaggt 29340 
aacatgagtc aaataatgac tgaaatatac tcatcataat gaggttacta cttcttaaag 29400 
ttaattaacc attaactata aagaacagat gtttcagtta aggaagcaag aattctacat 29460 
gtacatgagc caaaaatgac gtatatttta accatgattt actaaagaca atcatgataa 29520 
gatcattcag tgtagcaata gtttatacac ctaaaaaatc ctgtcatatt aagaacaaga 29580 
gtcactggaa gacatattaa aaggttcagc ccaaaaatac tactttagat acctcatcct 29640 
ttgtaattat ccttccagcc actatatttt acccttttgt tagtgggttt ttaattcaat 29700 
acaaatgtac tacatttaaa tattctttta tatatgtaac aaatatatgt tcacttattc 2976 0 
ctttatgtaa tacatgagaa agaacaacat taagcatttt cattttgtct gtttatgcta 2982 0 
aaataacaca caaatttgtc aattgtgctt gacataaatc tttttgaaga ataaaaatga 29880 
gtgataataa ttctctgata ttttttattt tgttactttg agcaaaaact taatactgag 29940 
ttttttaggg ctgctcttcc cagctcttac tctgtccttg ctgtaaggtc aggattagct 3 0 000 
ccccctgcag gccaccacaa acaccgcagt ctgtctgaga gcaaagtgta atctgatcgt 30 060 
tcaaactgtc tgaccgctac agctgtctgt gtgtgtgttg cccagacaaa acagggacaa 3 012 0 
gaggtcactg tttgtctgag tctcatctgc tggtttgaca tcatttaaac ctgcctcaca 30180 
tgatacagtt cacagtctca accctaagag cagaaaaatc aataacatga caccttatca 30240 
ccttcattgt atcactgccc ttacacgtga tcatgataca accacaataa aataaaacac 3 0300 
tcagacttta atacattcta cagattcaga tatagtatac caatatgtat tagtacggaa 303 60 
actcgacaga aatcaaaaaa ataaattata aaccgcttga ggatttgctg aatttaatat 3042 0 
ccattggtta aagtaaaaca ctaatattgg tttaattcta aaaaggtctg aaaaaaagtc 30480 
tcaatttcaa cttattttgt ctctataata aacactttca tttgttgtga ttgaaatgca 30540 
gggttgttat cccatacatt attgatctct tgactttttt acagttaaaa cattgtcatt 30600 
gtgttgtaaa ataactaaat ctaaaaacat gcaatttttg ttatttaatg ctaaaaaagt 30660 
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attttaaatt tgataaaaat ctgtgcaaat taaacaaaaa atgtaatctg gaggatttta 30720 
aacgctaatg ctgtctaaac taatgctcaa gacacaaaaa acctcaaaaa agtactccag 3 0780 
ggtgtcttaa aacctttaat tttgacgcca caaataagtt cttgactttt aaatctaaca 30840 
tatgtaatct taaaatacac ccctcacttt tctgtttgtg taaaaaaaaa atatacacaa 3 0900 
aactaatagg atgtgtttat aatgactcat ttttgggcga gctttacttt caaagctgac 3 0960 
gctgctgaaa tgctacttga agtggtaaga gctcacgacc aaattacagg aatgactaat 31020 
caaatgaagg attcctcaag gtcataaaca acagtaaaca gctcttcagg ctccagtaag 31080 
tcctctgttt agtttcacta acgtcaggct agagagaatg actcattgct tcctctgcgt 31140 
ctcaagtcaa aacatccaag aagacagata aaggaaacgg agataacaat gactcacaag 31200 
atcccagtct acatttcgga aaaaaggatg ggccatgatg tctgcgaagc ctgtctgagg 31260 
atgacatccc agccgttcct tcgactccta caaaaaaaag gactaaatgt cagtgagaag 31320 
aaataagata tattttagtt cttttttgct ctaattgtcc agctaattgt ttgctaaaag 31380 
tctaatttga tattcatctt gggcctcatg tatcaacgct gcgtacgcac aaaaactttg 31440 
cgtacgccag gattcacgct cagaatcgct cacgtttgga tttactaaca atgaactgaa 31500 
cgtgggaatg tgcgcaggtt cacggcagct ttctggctgg cgtacgcaca ttttttgtgc 31560 
gtgtctgttt tatttccatt ggcgactcct agaggcagtt gtgttaaatt cctctctaca 3162 0 
aagtgtctga gccttgcaat ggcagctgta tgagacgggt tcatctagta ggtatacaag 31680 
gtttccatac catacagttg accagctaaa cattaaagca caatttgcag cggtcgcctg 31740 
ttttcccaat gtaatctgag cgatctaccg cacgcacatt gctataaaga cactafcctga 31800 
agatgaattt gcatgagtga atcagaaaca tttccattca attaatgtgc aaataaaata 31860 
tgatgcacaa acttattgat gattcctact tgtctttctc gtgataaata gtgggcaaaa 31920 
tctgatatgt agcggggaaa aaagaagaaa gagttcatca gacgctggat tcgagccgag 31980 
tttatgctcg aacatgtcag tacatgatca catgcgtctt acgaggtgcg ccactgagac 32 04 0 
tgttaagggt actacaacat tttacagata taaaccacac tatttatttt ttaaatgcac 32100 
tcagtgcgat gttcagaccc aactgtgtta accgcatcag ctaaactctc ccactctatt 32160 
ttttttcttt tgttgttaat tccggagaac aaacttgcaa ataacaccgc ttttctccgg 32220 
tctacctccg aaagcagcac ctccatttca cattctgttc aaagtttctc tttttgcttg 32280 
cttttgccat tgcttttttg ttgggttttt gcattagcat agtcattagc atattcatac 32340 
gggggaagag gcagggaggg gttttgtgct cgtgcatgtt gcgctcagtt tcacgttcat 32400 
tcggatgtac aaaagaatat gcgtgagatt cggcgtacgc agtgtttcat acatctgaat 32460 
ttttctgcgt acgcacattt acagctttgt gcatacgcaa tgttttagta agatttccac 32520 
gcaagtcttc gtacatgagg cccctggtct ttgtatgaat aaaacagaaa cagtattatg 32580 
tacaatatta gactaaaaag gaggatcttc atattgctgc ttttctggta atttaacttg 32640 
cttctattga cctcatttgt gagttgcttt gaaaaaaaca tctgctaaat caataagtgt 32700 
aaatgcaaag agatgaacag acacacagaa tgattaacac acaacagaag gaaataaagt 32760 
aaagaatgat tatggcagca ctataacaag ctcatctgac gtaaggacat tagcattgat 32820 
ctgtctggtc tgcttcacct tgttgaggaa tcccttcagc acgcttgcgg ctttgaccga 32880 
taacgatctg ggaattctga tctgcttctc caaaatgact gcagaagaag tgtaaaggaa 32940 
aacatgagac acgcacagaa agacaaagca tttttgaaac gttcatgctg ccagtcatgt 33 000 
accttggaaa agataatcct ctgtgttttg gtcagggtta tcagagctgc cgactatgtc 33060 
gaagggagat cttccagcca tcatctcaaa catcaggacg cccagagccc accagtccac 33120 
actaaaacct ggacacaaat aacatgcatc attctaaggc ttcatactgg tataaatgta 33180 
atcgtcagtg ggggttatga agcaatcgct aactataata aaggtttcaa aggtgcagta 33240 
tgtaggattg attgaactag gtattgcagt ccaaattcaa aatattgaag tttgtttttt 33300 
ctcacttagc ccttccgctg acacaaaggt tgccagattg attacaacaa caggagaaag 33360 
agtgccttca gtttagcctt tcactgtaaa cggatcagct aatgtttatg tttgtagttt 33420 
tttgctaaat aaaaactagc tcatgtagat atacaggtca tattttggca gctcagagag 33480 
ccaaaatata atgagtaaac ttgcagtggg gggagttaca gagaccaaaa cattgacaca 33540 
aaactcccat tttcaaaagg agaataactg actctagcat catttttcag ataaacaagt 33600 
atgttcatta agtattcatc ttaaatgtct gcatatataa catatggtat ttttaagact 33660 
cagaagagcc ataaacttac atgcagcacc ttcatgcata agcaaatttg tgattacagc 33720 
gttagcaaac atagttgcta actgaaaatc ttcataaaga gtctctgctg tgtgtgaatt 33780 
gagtttacca tagtcttctc ctctcagaat ctctggtgca atgtaattgg gagttccaca 33 840 
gaaagtgctg gttgtatctc ctggtctcag tccctcctgt gtaacagata acaataatga 33900 
aaccaaaaaa acttataaca cacattagct atgtttccat gcacctattt ttatgcacat 33 960 
tttggataca agcaaaaaaa aattttgatt aatggaaatg tcaagaagcg catcaatttt 34020 
aaaacgagta ggacaaactt tttattttat aataaaacta aactaggatg gaaacacttt 34080 
taccgaacaa attccaggtt gcgcattaaa actggtcatg tgattttgtt atgagatcat 34140 
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gtaatgaaaa aaatatgtgc gaatggaaaa accagcaggc tgagctcact ataaaacatc 34200 
tgaaatgttg ttttagtcat tctaaaacgc cttaaccgtt tcagcattag tgtcattata 34260 
tcattaatct tcgcgtttca cgcctttaaa cgccaccaca tgttcactgc gagtcagttt 34320 
ttatgatcat ttgtgttttt gtttttgggt agattaaaag ctttcattat ttttaatata 34380 
gatgcttttt aatacatcaa tgcactatat tatctaaatg atactttatt tcattttaaa 34440 
ttattaattt aatttaattt ataaatgtac aggtttatac ttcctaaaat catatcattt 34500 
ttttcattaa acaaaaaggt tgcagtaata agaagtttat taccagcagt gcaccagtat 34560 
tttattcaaa agcaggtgag gtttgtgaca ttaaatgtta ttagatgtca gacagtacat 34620 
ctttttctaa aactttgagt ttgtttccac tgctgaaaaa agaaaagcag tgcccagtat 34680 
aaacgaatgc actctttaaa gatttctctt aaaaatgaat accatctcag accccctagt 34740 
tttttcaaga tttcacgatc gctaaatcca acacaaaatc aagaaaaatt cgtaaatttt 34800 
agtgatcctg caaaactctt aattaaacgc aaaatgafctg caaaataaat aaataaataa 34860 
ataaatatat atatatatat atatatatat atatatatat atatatatat atatatatat 34920 
atacatatac atatatatat atatatatat atacatatat acatatctac atatatatat 34980 
atacacatat atatatatat atacatctta aaacatcaaa ttccanacca tataatcaca 35040 
ttatatttat tttcagttta ttattaattg tattacgtga ctaatagatg ttgttgcatg 35100 
cgcagcgcac atcatgtatt tgaggttctc ttgaaaaatg acgtttcacc tgcgtcctta 35160 
caatcattac attatatgag agtaggcaga attgttttgc agcctgtgca gtaagcagac 3 522 0 
acggatgaag cgccagtgat ttacatttga agtcaatgca aagactagat taggcatcct 35280 
gtggcacgaa ttagtctgtg catttaatgc acttcagact gaaaaaaaaa aaaaaaagag 35340 
caaacatcta acaaacagag caatggaaca gacagaagaa caagcagctt acaaagatgg 35400 
aggttgattc aaggatgacg gccgctggac gtgactgaat tgaaggacat tatttgtgct 35460 
ttacatctat ggctggtaat acgatgtgaa taaaatcgat aaatgttttt gtttgactct 35520 
tttcctgcag gtaccgagag aaaacgcttc cttgccattg atgagtttta cgttaatctg 35580 
tttttaaggt gtattacagt aggggaaacc ctaatgtatg tcctgagtga ggtacacaat 3 5640 
gtcagatgct ccggagagta aaatctgaga gaaagtaaga tcatggataa aaataagagt 35700 
tatacaacct tcctttgact taatccttac tgttttagat cttgtcttga caagagacca 35760 
aaataaagaa gctttatttt tatgatttgt gtcattcttg tgtcaatcat tgtcagattg 35820 
cacacctaac aaagtaggca acctaatatg gtaaatatat ataaatttat aaaaataact 35880 
atttttttat tgacaaattt ttcctaaaat ttctgggaat taccaccaca aaatatatat 35940 
attttctaaa aaaagtgaaa actagggggt ctgctattga aaaggattct atacaataag 36000 
atatttgtgt ccatacatta cagtagtcag tactaaagcc aaaacttact taagaaaata 36060 
ccttaagatt gatgggcaaa aataagtacc aactaatgaa tatcttaagg aaaatgtaaa 36120 
ataaatcttt aatacagaaa aatcaagaga accacaaaaa tatagaaaat tgtattgaaa 36180' 
tttttcagtt tgtcattttt ttgacaacat taaatttaaa tgcatttttt attcctaaaa 36240 
atgtcaggtg atcacactct tattttaata aatatttaat aactgtatta aatatttaac 36300 
tgttttgttt aaatggacca aaatacatac attttcaata tccacaagga catgtataaa 3 6360 
aattcatttt ctaaaggtgt tgtactcatt caaaggttac catcaaaatt ttggtaccca 36420 
acttttttca caattacttt ctgtttggag caagtggaaa atttaaccaa tgaccacatt 36480 
cagacttacc ttacacatgc cgtaatcggt gagtttgatg tgtccctctg aatccagcag 36540 
aacattgtcc agtttcaggt ccctgtaaat aatgccacgc tcatggaggt agttcaaggc 36600 
aagactgatc tctgcagagt aaaacctgcg agcggaaaga acaagacaca cagtaagtga 36660 
cctcattttt gtttctgttc ctgtggcgag gcgttcagac aacgttaaaa gtgttatgca 3 672 0 
ttggatgtcc tcagacctgg cgtgctcttc cggaagtttc ctctgccgct gcatgtggaa 36780 
catgagatcc cctccattca catactcgat tacaaagaac agtctgtgaa agaacagcat 36840 
taaataatca ttagccccta ttatcagaca aagacacagt tcacacttaa ctaaaaggct 3 6900 
caacaataaa tggcctaaag ggcacttata ttgaattttg ggtgtncact ttaaaaaaaa 3,6960 
aaaattttac tttttattct cactacatct tacacttttt atacatgtat tttaatgatg 37020 
tcttatgact aggctcacat ggaatctgcg tacgcagaac tctgcagatt ttccgcagat 37080 
ttttagccca tcattaattc tgttcattta cccgagtaaa tgtgtgtaaa tctatattta 37140 
tttagttttt taattaaata aagtaatatt attgactaat gtaaaaatgt tcatcttatt 37200 
tatgtacaat gcagttcgta aagtattatt ttctgtcttt tagtagatat attagatgag 37260 
agacttgctt tgtttaccaa ataaagtgaa tctaattgga tttgcatttt aaacattaaa 37320 
tcaaaggtaa aaagatataa ttttttattt tcacattaag gttttagtta taatactccc 37380 
aaaatcattc cgcagaaatc cgtagatttt taccaaaatt ctctgcaaaa' atagcagaaa 37440 
atgttagcag attcagtgtg gcccttctta tgactaaatg aacattgatg tggcaaaaaa 37500 
aaaacactct tcaaggctgc atttatttca gttgtagtaa tagtattttt tcatctttaa 37560 
tgcatttgta aatgtcattt tttcctatga tgacaaggca gaatttccaa cagccattaa 3762 0 
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atccagtatt aagtgtcaca aggagtttca caaataatcc tattatgctg aaccatatcc 37680 
tatggtgcag cacaagtgta tttcttattt tcaacaataa aaaccgttta gttgcttaat 37740 
attttagaga aagctgggat gaacttatga ccatgaatgt tgttctaaaa tatgcttgtg 37800 
aaaaatatta taaaaagatt ttcagaagta taaatctgta aaattgacta aaaataggaa 3 7860 
taaattaatt tgaataaaaa tatttttttg ctaattagta atattcacaa tattatattc 37920 
gtttttacaa tgacaatcaa gaaacaaaaa cacagtttac acgaatttca taatataaaa 37980 
ggaaatgaaa tacaatctgt atcatatcat cagtaaatac aaatacacaa attataatta 38 040 
aatacattac aatactttga aaaatataaa ggcgggtaaa acaatctaac ttaaacataa 3 8100 
tataaactaa ttctaaatta acttactgac tttaactaaa ttattagctc ttttttaatg 38160 
gaatgctttt aattaactat agctaattat taatttaatg accatggaat gactacaccc 3 8220 
cagtcgaaaa ttatttttga acatgtctca ttgcttttag ctatatattt tggtacattt 38280 
aaacaaaacc attttattaa ataataattt aaaaaaagag tgtgcttacc taacatcttt 3 8340 
aggaataaaa tatgcttgtg aaaaatatga tattaatatt cacaagtata aatatatata 38400 
agtgtaaatt aataaattaa tgtcaataaa actattattt tgctgattta gtaatattca 38460 
caatattaat gagtttaacg atgacaatct agaaatgaaa atacagttta caaacgtatt 38520 
caattaattt agtgtcatat tataagatga aattaaatat tatgcattta aatacatctg 3 8580 
tattacaaaa ctataaaaac ttttgatcta cacaaataca aaaacacata ataataaaat 3 8640 
acgttacaat actttgaaaa atatgaagac gggtaaaaca atctaactaa aacatagtat 38700 
aaaataattc agttcaaata tatagcatct catttaaaaa aaaaaaagtg ttaataaatc 38760 
aataattatt cagtcatagt taaagatgct gtttgtaagt tgttgactca ttttgaagca 38820 
taaaaaaaaa taaaatgttt gcagatattt aagaaaatgc caagtaaaca ttcttgttta 38880 
tctgaaaaac aatgctgaag tcagatattc tgcttaaaaa tgtgagttac gtgccggaac 38940 
gcctgtcttt gttttgctcc tataacccgc ccaatgccag atgagccaat aacattccag 39000 
caccctgggt tgccttggtg gaaaaccgca tatttcattc attcattcag aaaggatctc 39060 
aaagcatgcg cccgtgaccg aaatgcgacc tccggtggac agtagaagac tccgaaatga 39120 
gacgcagatt cagagttcta tatgaggtgg ttattaatta gcaaataata taaacactac 39180 
gaccataaac attaggtgag cagattacat tgtaacccta tggtccaaca acccactacg 39240 
tgaagagatt tgcaaagata agcaatttgg ctctttgcac aagacaaaca cgacagaaat 39300 
ttaaatacag ccattcagaa acacagaata tgcactcaca caagaaatgg taaggtttat 39360 
catctaatta atacatatta aacctcttta acattattaa atgtagaagc tgagtcattg 39420 
acatgtgttg gttttgagtc acagttcaaa gttcaatttc aagcggctta ttttattttc 39480 
gagatctgag gtgaacaatc tgctgctgct ttcaatagta tggcaataaa tgtcatttaa 39540 
aatggtattc aaattcacat tattagtatt taacactaaa taaagcacat gaggtgtacc 3 9600 
tgaggagatc actgtcagct ttcagttgtt cagctatgag aaatagaagt tgtttctaat 39660 
atatcaattc aaggtttggt taggacaaaa acttttaaca ttgtaaatat tccttctatt 39720 
gcgtgccgtt tctttcatat acaacacttg ctcctctttg tctttaatct ggcaacatgc 39780 
gtttgcattt gttttgatcc aggaatgcat tacctagttc aaccacttgg tgtcaaactt 39840 
acaaactgca cctttaatca aaggtgttcc ataaaaaagg gctaataatt tagttaaaga 3 9900 
aaacaattga gagcttaact gcataaattg taaaatagaa ttaatatgaa cccatgttta 39960 
aaacaatcgt gaataacata acaatatatt tgatttgatg atctaaaaat gtgtgcagag 40020 
atggacagac ctgctctccg tctggaagca ggagtgaagt cccacaagga agggatggtt 40080 
tgaagcctgc tcaaacacat gcttttcagt ctgaacccag tcaatatcct gaaaaacaag 40140 
aaagaagaac aaacatcaca cacacacaca cacacacaca caaacacaaa cacacaaaat 40200 
caatctccca ttgaaaagca gagcattctt tcagtcagta ggggaaacat ttggaaggac 40260 
attcaggaca gtattgagca gaggctcctc gcaaggctga aaactgataa aggttttaaa 40320 
agacgggggc accaggtctt cccgagaagc tgctcctgta tgtgaggaga ctcataatgg 40380 
acagatacag agtgaaaaac taagaccttt cattgacagc ttggtaaaaa tccctctgac 40440 
cgtgagactt gcataaagaa gcactgaggg gccagtagag cacagctgcc attattgacc 40500 
atcggcttct gctgtcagtt catcacacaa gccttaagtg cggctaatca tcgttaaatg 40560 
gtcacttcat caagaccctg tctgtttgtg tgcacttcaa cacattctgg aaagcttata 40620 
tccaacgacg cctctttgct taatttattc agcatagtca tgacttattt tgcttttgtc 40680 
aggagaaata ttttatttag taaatttcat gaagtctgtg tagaatgtgg tttatttgaa 40740 
ccaattatga ttcacaatta gagctgtaaa tttcatttaa acggtagggg gccaagaaat 40800 
aatacaatgt aaagcaatgc acagctaaac aatatgctaa ttgtggcctt taaaattata 40860 
cccagtcgtc acaaatataa caatgcatgg gaaatggctt tggcatgtta gttgcagtgc 40920 
actacgcatc acactatagt aaacaagcta atgttaacta gataaataat attttaatcg 40980 
ctaatacagt ggattcaggg aaaactacat cagtaatcag catttttgca accaagttat 41040 
gaacgagtct gcatcaactc tattgatgaa aatcgcttta ttttaagtga ctaaaatacc 41100 
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ctacttactg gagtccaaca tadactaggc cccagccaca aacctgactc gtgtgtgtgt 41160 
gtgtggtgtg tgtgtgtgtg tgtgtgtttt tgtgtgtgtg aattagtata gacgaactgg 4122 0 
gaaagcatgc agtggtccaa accactgtga gcaaggggag ggggaactca acggtttata 412 80 
aacgcactct ttgtgtttgt tttcttactt atgcaattat tttaggtata catacatata 41340 
tttttcacaa tagagagtaa gatttaaaaa aaaaatgttt tggggacaac cctcagatgg 41400 
aagacatctc tccgtgtttg attttcttat acatacgatt ttcttatata cggttatgct 41460 
gtcggactgt tgtataaacg caacatcaca cttttagcag tgtgatgcgg ctgtatattg 41520 
tcacttgtgg gacacaaagg cattcgggct gtggactcgt gccaacgcac gcctcccacc 41580 
agtgccgata tacagctaca tcgcactgct actcgtgtga tattgcttaa tataatttca 41640 
acagatgact aaagattggc aattatttta gaaacaaatg taagtgtatt tgcttaaaat 41700 
ataacaaaca aatttccagt aaagataaaa acaaaagatc aaaatttcaa aagaaataaa 41760 
cactgctttt ctggttctgt tttccttctg tatagtctga ccatgcaaaa ctaaatgtaa 4182 0 
ataaaatact gcacagtcat cactgtataa attaaataca attaatattt gtaaagctgc 41880 
aaatgttggt gtttcttgtg catgaatgct ctcttgttgt gtttaagcct ttggttcaat 41940 
ccaactttgg attgattttg cagacacaga caaattttca aattctgaga caaaaccctc 42000 
cagattgcag ctaaatgagg aaatcgtaga tctctctctt ggctgtgact gcttcttgac 42 060 
caaatgcgct tttcccctgc atggtacatc ttggctcggc ctggttctgt tcggttcagt 4212 0 
gcggcttgaa tcttctcgct tttctttttc actgcagttt gatatcgctt taattggtgc 42180 
gattacagtc atatcatcat agctgtgccg tctgcactgc attgccatga cattctgaat 42240 
cagcccattc agctctgaca ggcactcatt tatgatgctg tgacaatcat tactttcaat 42300 
tttacattgt atgtcattat tgttcccctt gtgtgtgagc atgtgtgaat gtaaaaaagc 42360 
gagaggaagc tttcgtcttg tgcagaagcg aaatcaccac ttgatttttt cgctgcttgt 42420 
tgtctttaca ttttcatata gtttggcatg tatgtgtatt ttaggattat ttcggattaa 42480 
agcccaatac tattctattt gtgtacccct accttttggc ccttgaaacg gagtgcaata 42540 
tgtcattgcg agctcatatg agatcgatga tcgcactgct gtagttattc cagttgcact 42600 
atattttggt atttatcttc aggaaatcgc caaaggcaat aatatcatgt tgtcataaca 42660 
atataatgtg gcaataagct cgtaactgta ctgtgcattt acactgtggc catattcatc 4272 0 
tatgcaaaag aaatcaacat taacgttcta ccagacactg taaaaaggtc attcccagcc 42780 
gttagacttt tctgacaggg tattccgagt gtcatcaagc attagatgtg gaaagtatgc 42840 
tgtgggacag ctatacagga gctattatta tggattctag attttatcgt ttattttttt 42900 
aaaacatcat aaaacatgaa tgccattatg cttgttttaa cacaaaaaca cactcattta 42960 
tgtattatca atgaattaag ccaggaacaa agatggcggc ttgcagttgg aacgtgcaaa 43 02 0 
cagtgctaat aatcactgcg ttttatagtt tgctttctcc agttaaggag cccattttac 43080 
atttttaata caggacagtg atatggtgaa gagacatttt ggtaggagag caaatattgt 43140 
ttaagagtcg tttatatcta ccgtgtacta agaaaaacaa acgtaaaaaa aaaaaaaccc 4320 0 
aacaaagctg tgatggatag ggactaagcc gaaaaggaaa ttaatgatta aatatatata 43260 
tatgtatttg tgaataattt tgttcctgtt ttaaaatgtg gaggtactgt ttattttata 43320 
catcgactgt attttgaagc catactttat ttaccagcga ggcccacaat ggaaacaacc 433 80 
cctgggcttt attgtgtttt tatcatcaac agtttaatta aaaaatttag gggctacttc 43440 
tatttttgta cagcttgtct aaaatctgtc aaataaaaat tcaatttaaa attcaattca 43500 
attcaatctc ctgacttctg tgtgcagcca tgctgctgtt gtagctggtg tattccttgg 43560 
aaattttcgt acccctcggt ttcgagtgtg gtcctggaaa atctctgttt caaggactat 43620 
agtccttact cttagtccaa tgccttcaag ctaaagagaa ttgaggcagg gaagggaaag 43680 
ggctaagggg ttgaattggg attagcctaa gtgcctggag tctttttatc tcacttcaat 43740 
atagaagtga tttacaatat agaagacgtt tcacataaag aagtgataag tgagctgtag 43 80 0 
caaaccaaac tgtggcataa aattttctat gacaagaaaa aaaaacacaa catacaggta 43 860 
gttcacaagt tctctaatgc ttgcttttaa aggtcaggtg ttattcatta tccaaaatca 43920 
tgctaatgat ataacagcgg cattgtttgc aagttcctat ttttatgaaa tgtgaaaagt 43980 
cacctcgtca tcgttgacca gctccttctt caccaccttc atggcataga tgcgttctgt 44040 
ctttttgaga cgcaccagca gaactttggc gtagctgccc ctgccaatca ctcgcagcag 44100 
gtcaaagtct attagaccca aactggacac cgcttttcct gattcccgac tgcccacagc 44160 
ctgaaggatg catgaacgga gtattaaacc agtatcatac aatcatacat ttctgcttcc 44220 
tggtttagat gttcttctct gtcctctctc tttgtttttc ccttgagccg cttgatttcc 44280 
tatgactgcc acagagctac agagctcagg cccttaatca gatgctaatg accctgatgg 44340 
aaggttactt tattagctgt acgaaaggaa atcaatagag ccgaatggac cttgtgcaca 44400 
cccccatacc actgcatgag tgacaagtcc agcctaatca tcagtctgag agcgactgct 44460 
gaaagctcat ccaatcacac actggtcata atagccacat ttacacctgc aactagcttt 4452 0 
tatttagact gatttcatga gcaggtggag agtgttgaag acaaaccgaa ttttacaaac 44580 
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cacatttaaa aacattaaat aagcaagcag tcatttaaag aaaaaaaaaa tacaattgta 44640 
agtcaacatt aacaactatt gctattatta ttttattgtg gtttaaatta taaacatcca 44700 
acttaaaaat aataaataaa aatcattgat attctaatat aaaaatgtct tgatattaat 44760 
acaaatgtat tcttacaata ctaatattta tacaaacatt atattttccc aaaaactcat 44820 
tttttatgat atatttccaa ataaatcttt atccaaaggg aagcattgaa gattagttgg 44880 
tgttgtatac agaaatatac atacaagttg acattttaca aatctaaatg acatctttta 44940 
cattctactg acccccaagt ttgaatgcga ggaacccctc attttaagct tattataaat 45000 
gggtattatt tggaaaagat ttagcttcta ttcagttact ttattaccat ttaacactta 45060 
tagtactgca tttatttatt taatcacata tttgtttcct tattatttaa ttttctgtat 45120 
tattattgtt aattttatct atatgttaat aatatttagc acatttaact tgatgtgtta 45180 
atgaattagt atattaatga tttattattt atatataaaa tctacagata taaataatag 45240 
atataactac tacaacgtat taatataagg gtaacactac aataagggtt gtattttcat 45300 
tattgttagc taatccaatt actaacataa aaaattacaa aacactaagc atcacagtat 453 6 0 
tttttgtgtt agttaatgtt aaagaaaata catttgttta ttgtgagttt atgttactct 4542 0 
tgcagtgcat taattaatgt taacaagcat gaatttagat tttaataatg cattaataaa 45480 
tgctgaacta tggttaataa ataccgtcca agtatactta gttaatttta gcaaatacat 45540 
taacgaataa atgcttactg taaagtctga ccataattat tatatatatt gtaatcaact 45600 
ttaggattat taattttatt catcttaata aaaggctagg taatttagta gaaacgtgtg 45660 
gttttatttg aataattaaa tgaagaattc tttttaaaca tggaccccct ggaaaaccct 4572 0 
tttgacaccc tgagggttca caaaactgag atggacatct taggcagcag tattttactt 45780 
gtttactagt aaatgcatta gctgagatga tgtagatgcc aggagcaaag catgtcttac 45840 
ctcatgctcc tctccctcat gattgatgct ttctgttgag ttctttttgc ccagaactga 45900 
aacacaacaa agagaaacaa gaaaagtcat tgctcaatgc cctgcagaca ttattcattt 45960 
atacacaatt attcgcaagc cgctaaacca aactgggcac tttgataaag cacaaaacaa 46020 
caaggaattg aaatgtatct gaccagcact gtaactaaag ccacaagctt gttcccgtca 46080 
ttaattccaa caaacaagct taatctagtg agaagactca cgcctgtggg ctgctcaatg 46140 
catcactctg tttgagtgcg agcacaccta attactaaga ggaaacatca gcttaaccca 46200 
tccagtcatt tcaaatccct caagatgaaa atcagtggcg tcaatgagcc actatagatc 46260 
ttactgaatc aatttagcca cttaatgaat gcaataactc agatgagcat tgtttaacac 46320 
ctctgcatta caagcttctt ttttagtcaa attgctgaca ataaaacaaa gtctgattga 46380 
ctttgtatgt ttgcaaactg atctgttggt aaatgtttat cttaaaaagg ttgcaatcag 46440 
aatatgcaca ttttacaatt taggttcagg aaagattgaa cttaagttgc aaaatatgaa 46500 
tactgaactg tttaaattac taaacaatct aaaagttgat caaataaata cattgtttat 46560 
tttatcagat atacgaaaaa tctgaacaag gtacaagagg ctgattattc attgacccga 46620 
ggcagaatag ttagaactgt ttgaaagata aaataaatct caaaaaggca ttctttcaaa 46680 
aaaatgatta gtctgcaatt caaattcaag taagaaagaa aaatctgata gacccgatat 46740 
agaatacata tatgacaaag gcaaatatag atgagagcct gtcgtacctt gatctgggtg 46 8 00 
ctctggatga gtcgaccctg gatcgattct tccgatcatt gggtcctaga agaatagtaa 46860 
gaaggtcaag gacagataga ctgagattca gagtattatt agcagtaata gcatctaatg 46920 
taaaggtctt gaactagaaa aggaatgtga attgtgaata ggctctcttt agaaagtgct 46980 
taatccaaca tgaattatgg attactttta gctttctttt tttaaagtaa taatgtaaat 47040 
aatttcagca aaaaatcagc tttaatgaag aaaaaaaagt cacatacata aaatcaaata 47100 
tcaatataca taaatcaaat gaacaacaaa ttttaatttt agatgaacaa tttaagtaaa 47160 
atgtgaaaac aaaagtgtga aatttgtgaa attgcattta aatctctcgg atgtfcttgtt 47220 
gcaatcctga gctttataac caaccacatt ggttggtttc acttacatgt ttgtttgcca 47280 
tttcagattt tttttaaatg cattttttaa agagtcattg agatatacat attactgcta 47340 
aaaatactac aggtaaaata aaaccattat ttatgtccgt gcacaactaa aaaataaaca 47400 
aatttaagtc catgaatgaa gaaagggcaa tttatagatt ttttttctta tcgcgattac 47460 
actactcata aatgttgttt gaatttgcca catacacgca ttgatcgaaa catccatttt 47520 
attgtttact gagttcttat tcattttgct gtaaaaaata aacacactaa aaaggaatta 47580 
tgattgtatt tttaatagga aactttggat ttctagtgat ggttattatt ttttaaactg 47640 
caaaatataa ttgaagctag acacctggag agccacatat ttttggtcaa atagccatat 47700 
gtggctagcg agccataggt tccctaccac tgttgaaaac agtttgttaa tcacttttta 47760 
ttgttcattt ttaattgcta atagttttaa ccatagaaac cttaaattgt aacttcattg 47820 
acactaatta cattattatt tttattttaa gccttaaaag ttaccctagt aatattccac 47880 
tcattttttc aatgagcatt acaatgagtg gcctataaag ggttaattaa ttgaaggatt 47940 
tgacacaatt gataagatca tgataataaa agctcaatgg aaatcaacag agaaggcaaa 48000 
tggtgcaaac tattagaaaa gcaaacttct gaccatgcga atgcctcacc tgtattacct 48060 
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gtctaccaca ttctactgtg accagcttat ggcatttctt atgcaccaga agcttacagt 48120 
tgatacactt gtatccctgc ctgcccagac cccagatacg atctgtgcag atggcacaat 48180 
gagcacgctg tacaaaacaa agtgacaggc aaattaaagc aaacacaaca tactacagag 48240 
aaataaaaaa ataaaaataa aaaaaataaa tatatatata tatatatata tatatatata 483 00 
tatatatata tatatatata tatatatata tatataataa gagcatatac cctgttaaag 48360 
cgtttggcct gaaacgcatg tccagtggca tagtagagtt tcctccaacg tcgagctccc 48420 
cgccggtata tagactctga gtacaacaag tcaataaaat ctatgcacca cttacacaga 48480 
atcaatgcag ccacaaaatg tatttttttt ttaaatatga tataaatatc agaaaacaca 48540 
catactgtct tctccaggac agggcatgcc aggtttttca gggacacaag gaaacactgt 48600 
ggagagagag agagattgtg aaaaaacagc tcctgcatga cccctctgtg acctctcctc 48660 
gtctgacagc ggataaggac tttgggtcaa caaacagagt ctgtccaggc ctacaaggtc 48720 
agccggtgtg gacagtgtct attgacggac agtcggtggg ggtgtttttc tgaaagccct 48780 
tgtggccagc cgtcgtctcc tcagcataac taatgaactc acccagcagg cagaagaaca 48840 
atgcatctct gtcaattagg cttgcatgat acttgtgata ttgtaaaatg tgatattgtt 48900 
gagttttgcg ataacaatat ttcctgtgat ataacattta ccaagagaaa ttttatttta 48960 
ttagctatta tttaaaatca tttatgtaat gatcctagta catatctatt tgtaattcct 49020 
gtttaaagca aataactatc tgtatattct gttcatatct atctgcatgt accgaattat 49080 
taatgaaaac ctgttcagta tgttaatcta taagtaaatc tcttattata gttaagaaaa 49140 
cttatatatt atgttcacag tggatccatc tgtaaaaatt acccatagtt ttctatagtt 49200 
gcactcataa tgtctaccta tatcctggac tcctggatgg acctaaaccg catttcgttg 492 60 
ccttgtactt gtgtaatgac aataaagttg aatataatct aatcaaatct acgaatataa 4932 0 
acaggtataa ttaacctagt ttagggggaa aaaacaagct atgaacattt agactttaaa 49380 
acaatgtgaa gactataaac cttaacatga catttttttc ttattgccat ttccctgctt 49440 
ctgatattag tatttatttt tcaaatctct gctctgaggc actagattca cctttggtaa 49500 
taatagcatc tactgggtac aaaaataatt acaggtataa aaaagataaa ccctatataa 49560 
ataacctttg catgcaacac aaatgcttgg aacataaaat aaagtgaatt taagtcgcac 49620 
ctctgtgcaa tacagatatt gcacatacga ttatcgcaat aacaataatt tttcagtata 49680 
agcagcccta ctgtcaaaca ttacccgaat actttctcat ttgtccaata agcatctgaa 49740 
gacctgtatg actctgtagg gcttgttatg aataaataaa caaacacata tacccgagcc 49800 
ataacatcaa aatgtggagc ttcatcagag ctcatcacta gggtcttgac atgcttgaaa 49860 
gaatgcgtgt tattatgact catgtaaaaa taaactgttg atggaaacat taagatgtgc 49920 
ataaatggaa atcataaact gagtaaaaaa tgcacaaaaa ctatgatgga aacactttta 49980 
ccaaacaaat tccactatac gaataaaaaa aaaaaagtca tgtgattttc ttttaagaga 50040 
tcatgtgatg gtaaaaatgt gtgtgaatgg ataaaccagc aggttaagca cattgtaaaa 50100 
catctgaaat gttgttttgg tcattctaaa acgccttaac catttcagca tccacgtgct 50160 
ccatgtctca cgccttcaaa cggcaccaca tgttcactgc aagttgggat tgccttctga 5022 0 
ggtgcaagtc atttattaaa taaagaaaac aattacgcaa aaaattccat ttttactgtt 50280 
gatatttggc agcagttaat cagacagtga tgattttgtt ctctttgact ctgcgttatt 50340 
tgcatgtact ttatgcaata ttccagtttc gcgcatacat ttaatttgca tctttggatg 50400 
taaacatagc taatgacagc aaaagttgcc aattaaaatg cgaagaacac aaataattct 50460 
aaagccaaaa ggtcaactag aaaaccttct cattaagatt gtgttacttt tttcacattg 50520 
ttcaggttaa ggtgcataca aacaatcaga aaaactatat gcttgttact atatttgata 50580 
gtgcattgaa ataatctcag ctttcaaatt atgataataa tgaactataa taaagtttat 50640 
aaactggtaa aagtacagaa aacaccatta acaataactc caattggttt atttcaatgg 50700 
aacpttttca taaacctgtt atgaggcatt aaacagttgg gacaggacag caagtttact 50760 
gttattctca cacaattgtt ttttcctttt tctgaaagtt tttataccat catcgtggag 50820 
tttttctttt attatttgag caaataagat tttttttcca aacttttttt gtattctcct 50880 
attctctgcg ctctaagttt ttccaattat gatgacttct gctactgaga aacctggaaa 50940 
tgtgaaaagg gtctattgtt tctctctttg atttggtctc tggttaattt cttctgttca 51000 
gtctgtcatg atcagacatc cacacagagt tgcttaacag cagaaactga agagtagctc 51060 
cacttaaaaa caacaatgat tgtttatcat gtttgttaca gtaaaacagt ttgatttgat 51120 
aaaatctact ctattaagtg ctatttacta gggctgcaca acattggaaa aatataacaa 51180 
aattttgtta ttctccaatt atatattgca atatgaatac aatttctcca gatgacttta 51240 
tcattttagt taattgtttg ggataattct gcaggtgcat ttgcaaaaaa cctaagaaag 51300 
atctctacaa gcataaaaaa atgcaataaa aaaaattata taaactgtta ttcattgttt 513 60 
tcctgaatac taacagtaga cataataata ctgcatagtc tttgttgtat aaataataca 5142 0 
ataaaattat tggtatttaa tttaaaattg gttcaaattc ttatgcctga atgtttttaa 51480 
atccctttaa cggtcacatg cctcaaatat gaatgcaaat tatgaattat aatccaaaat 51540 
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caacattgca tatcatgcat tgttattgta tattgtacag ccctactata tacaacaaac 51600 
aaaggtaatg taactgatgt atcaaattac atctatgtct attttcttat tgcgaagaaa 5156 0 
accaaactac atatttacat tctgatctaa acagtgtagg atgatgacgg gatgttctct 51720 
gctcagttta ctgctgctct gggcatttaa atttttttcc agaacaaaaa aagaactttg 51780 
ctgcgagtac atcggggcct taagacctgg tgtaaaactt catccctcat aattctttca 51840 
agaatcgaga gagcacaaaa gcaaacgcac tctagctaac ccacatcaaa agccaaatcc 51900 
cagcataccg tgaataatga gctccgagtc tttgttgagt tcatatagac gcaaggcctc 51960 
ctccagctcc agctgagaag aaacggtgca cggatcccct gaaatgacaa gcacagagaa 52020 
cgctcaaaca caactctcca ccaaagacat cagcataact gacattgcga agatacaaac 52080 
gaggcgagtg agggagaatt taagcaggcc tgagaaagag aaaagaaacc gagcgctctg 52140 
atcatatgga gcgtaaagaa aatatctgtt cgattgacat cataaagccg taacacggcc 52200 
catggtgttg tacgaagctg acgtcaatgc tgttgtctga agagtagcct gtggcaccaa 52260 
cttcttgcac agacgcacac ggctttcaat caaacatcac agagaaggca atgagcgacg 52320 
gcgggcatcg ctgccaaggc ttgcgcagac agacgggatt aacttctcgc catgacggcc 52380 
cgctcagatc agctatgcta atgagcagcg tgactctggg ctcagcgtgg cacggtgaag 52440 
attttggtat ttaaatgaat tttcttccat ctccatttcc catctcgcat gactaatgca 52500 
gcagttaatc atcgcgcgca gtcaatcttt ccagcataaa accccgctcc ctgctgtgtg 52560 
ttgcattcag attcgcatta ttcctcatgt aaaagccatc tattttagtt ctgccggtgg 5262 0 
agtccagggt aattcggaga ctttaatgca atctgccaaa agtggcttca tttgtaatat 52680 
ataaagtgct ggtgcaaatg aggcaggcgc ttcaacagaa tggcaggagg aagcttgttg 52740 
ctacggcgac agttaaagaa acagaaggaa aggccacgtt tacacctggc attaacatct 52800 
gtctcaaggg atttgatcaa gcggacagca ctaaatgcag gttcaaataa agtgcaaaag 52860 
attttgtgat gactcgaatc ccatttgtgg ttagttaaac acacacactt tatttatatc 52920 
atcaatgctg atgcatccag gacatacaac acaaaattac atattgcata agatgcaata 52980 
acatattaaa afcatgataat aacaacaatt tagtatgttt aatgtctaaa ttgaatgaac 53040 
tgtcttgaaa gaattcagac atgatctact gggattttca cacacaacca tttctagggt 53100 
ttatagagaa gggtctgaaa aagaaaaaac atctagtgag cagctgtttt gtgggtgcaa 53160 
atgccttgtt gatgccagag gagaatggcc agactggttt aagctgatag aaaggcaaca 53220 
gaactcaaat gaccagtcat tataactgag gtatgcagaa gaacatctct gaacgcacaa 53280 
cacgtcaaat ctgaggcaga tgggctacag cagcagaaga ctacactgag tgccactcct 53340 
attagctacg aacaggaaac tgaggctaaa atttgcacag gctcatcaat atagtttgga 53400 
aaaatgttgc ctgctctgag ttttgatttc tgctgcaaca ttcagatggt agggtcagaa 53460 
tttggtgtca acagcatgaa agcatgtatc catcctgcct tgtatcaacg attcaggctg 53520 
gtggtggggg tgtaatgtgt gggggatatt ttcttggcac actttgggcc cattagtacc 53580 
aatcgagcat cgtatgaaca ccacagccta tctgagtatt gctatgttta atacctaaat 53640 
tgaatcaact aggagcaagc aaaatcatgt cttgaattaa ttctggcact aatatggccg 53700 
ttcagcatta gtataacatc gaaaagtgca ttttattgtt aaacagtaaa ataaacatta 53760 
aactattcat gaaggtgacc cttattctca agcaactaag acgatagcct ataaaacagc 53820 
tcatttagaa aaatgggtca attaagtgta atagccaaaa ttcagtttta acaagtaaaa 53880 
gctctctttg gtttaacaag ctgtgtgtaa tcatgaaaag agagagaggg tctctttagt 53 940 
gaaacctgat ctaaagaagt cactagagat tcatgttaac caggagtaaa tgtcaatctg 54000 
tctcgactga ctgagatcac ctgagacaga tgccaacacc tactgtgaat agagcaaaga 54060 
gacacaaaaa tgaaacaaca aaccttcctc atcaatccat ttcatggtga agagctggtc 5412 0 
attgtccatg gagcacatat cacgcacctc attgcagagt ccctcatagg agatcgaagg 5418 0 
ctcaaaatgt gtgatcatga tgtccctaaa acagaaaaaa aagtgacatt cattaaaatc 54240 
ctacatttta gggctgcagg atattagaaa aatctgacaa tgcaatgttt tgtttttctg 54300 
tgatgtgtat ataatttccc cagatgactt taaatacctc gatttggtat gaaaccatat 54360 
acagaattta atcaaatgta aaatagcagt gcttagattt cattgcataa ataaatcaat 5442 0 
tgaatatttt ttattaaaga ctggtacaat tttcatgcct gaatgcttta aactctcttg 54480 
atatgcttta atcacaggcc tgaaaaacac atgcaaataa aaaacttttg ctttattatg 54540 
gaataaatct tcaataactg tggctcatgg taaactatat tcacaaattc aacattgcag 54600 
atcctgcaat ctgactacta cagatgcaca tatttcaata tcaatgttaa aacaatatat 54660 
tgtgcagccc tactcttgaa aaattaaggt ggggcgagag gaaaaattta tttaatcaat 54720 
tcatagttct ctaatatata ttagggatgc tctgatcaat cggctggaga tcagatttga 54780 
tcggtcctca cttatctggc tgatcacatg aaccaaactt aatgttagat tacaggttta 54840 
caagcagagt aatttacatt ataacttatt atgcataaaa aatgcacttc gaactttttc 54900 
ttcattgagg catgttgact tattcttcca tcatttgttt ccaaattgca ttgtcagtca 54960 
tggtacccaa ccccacccct aaacacaact gtcattgggg gatgagtaaa ttgtacttaa 55020 
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ctgtaccagt gagattgtaa gaattcccac cccatccact aaatcaaaaa cttctgaatt 55 080 
gtctagagat agcattggtt attcatgttt ctatccgcat ccctgtaaca gcatgcaata 55140 
ttgtgcttat ataaagtatg aaaaagtacg actgtataca atataattta ttaatccaaa 55200 
aataaacaat aaagggtaac attttttatt ttatgaccat tcgatatata tctataccat 55260 
cataatgcac agccctccta aaaatatgat aaataattta gtgtgggata ataagagaaa 55320 
tctaaatact atattagtac attttaagac caaaaacaca gtacatggag tatcagagat 55380 
ggcgagtttc aaggaagctg taatactagc atattcccat agcaggaggc acacagcagc 55440 
ttcttactct ctgttgtcta ccgagttcac actgttttgg ttaattatta acagcaggga 55500 
gtctttcctg tctacgagcc agtgcactga cccacacatg cctgcgagcg gccactgcaa 55560 
ccatcccaca catcacctta tgacgaacca aaaaaaaaag tgctgctttt atctgatcca 5562 0 
ggactattac gatcaagcta ctgtacgaga cataaaaggc agttatgcat gtcctgtgac 55680 
aaatatctct gagattataa tagctccgca ttactaaact actagtgttt agttcatatt 55740 
tacagagaga gacagattta ctcttatcat tgcataaacg atataacaat attctggcat 55800 
ataactgaag gttttgtgcc cactatctta gtagtgagag atgcatcctg tgaaagcgca 55860 
gcatctctga aaggaagcag caaagccaca ttcctctgta gggcatacct ggatcagtac 55 92 0 
acactgcttt tacaacaaca agcatgatga gcaaacattc agtcccatgg tcaaatacga 55980 
cttacaagaa actcaaacac gattttctga acctgaattt gtaaggaagg cttacgggga 56040 
ctgaagtcaa ggcactatac taggaatagc acacaaggaa atatgaccat tcctttaaag 56100 
caaacaatag gtgtattaaa gggaaagttc gctgaaatga tgtcatcatt taaggtgctt 56160 
tcatactagt ttagctcata ctttgtttta acttcaaaca atttatagtt ctgatgagct 56220 
ttaagaatgt caggtgacac tacgttaagc aaacaatgat cattaaatga caccacactt 56280 
tttaaaaata actataacac attcaaaata tgaccttaaa ctttccagca tcttttaaaa 56340 
aaaaacacaa ttcaatatat aacttccaaa atggtaaata tgcattctgt ggcacttttt 56400 
accaatgcta acattgcagc attttcgact ctggtattaa actatacctg gcaacatcat 56460 
gactacaatt aaaaaaacaa gtgtttgctg cacattaaga ctgccagatt ttacattcat 5652 0 
ttaagttata ctgcaccaac agcagtggtg gttcagatga cataatagtt tttgagtcac 56580 
ggatcggacc atttttcgga tcagcaaaaa aaggaaggag acaaatgtca attgctttca 56640 
attcatgcaa aaatataagt gcaaaagcca ctggttttaa aaaacagaac ttagaacctt 56700 
taattttcat aaaaatctca atttaaaaac aaccggctga aaaaaaaagg aaaattttga 56760 
atacgaaaaa tgatctttgc tactttagct gatgatgcta aatatataaa tctaacatat 56820 
tgtacagaaa atcatcttta ttttcaatta atgattcaac aattcacact taaagtttca 56880 
tgtttcatta aagttttatc atttttgaaa accggatttc agtctttacc ataaacatca 56940 
gtgtctttat ctgaactata aactcttctc ctgtgttatt taattgcttg taatgaactg 57000 
aaagtgtaaa aactgaatct ctctcggtca aacacagatc tgaatgcagc actttacact 57060 
gaatgcatta attcaggcta aacagtgaca gaaaacactt ttaattaata acctaagaaa 57120 
gcatttacgt aggtcccacc acaaccgtaa tccataatca ttatatttca caaggaagcc 57180 
atgatgcttt catacatgtg atttgaatga cactcgaggc aatctgtctg caattgttat 57240 
aaatgttgga ttaacctaca agtttaaaaa gttaataatc cgtcattaaa catgtgcgtt 57300 
ccgaactgtg ggttgtgatc tgtatggatc acgaacaaac cgtgatcggt tacaccccta 57360 
accaacaggg ggacaataaa tattttttta ttttaatttc aaattaaaag gagctgtcaa 57420 
aaggtggaaa aaaaacatat tgaccaatca atacagattt tttacatttt ttaaatgtcc 57480 
aacccatctc attgttagac atgnnnnnnn nnimnnrmnn imnnnnnnnn nnniumnnnn 57540 
nnnnnnnnnn nixnnimniiiin. nnnnnnnnnn nnimrmnnnn nnrmnnnnnn nnnnnnimnn 5760 0 
atctctgttt tttattatat atatcctggg gtatccctfcc aagcaagaaa aacaggggac 57660 
taaaatgtag atctatctgc agaattttca ttatctcaat ctttacccct tgccaaaatg 57720 
cacgtatctt cggacattcc caaaatatat gtgcgtgatt ataatgagct gtttttaatg 57780 
tctaaaaatg aatggaggta aataagacct gaagtctcga gtcaaaaaga ttcaaatggc 57840 
tgcaaccgct tgttcatgaa gaataaggta aataaatgca ctcatgcaac atggattttt 57900 
tgctttaaat gctattacag tatttaatgt taattacatg aacaaataaa actttacaat 57960 
taaagtgtta ctataaacat attaaaattg gaattattaa aattaatatt aacaatttag 58020 
tgttctcaca aattagagat tggtaatgca tataaaaaaa -tcaagaaaaa gtttaaagaa 58080 
aataatttta agtgacaatc taaactattg cagacagaga aggggccacg agccaagcac 58140 
atctctactg tggaaaatgt tttagcttaa tagttgcaat ataggtcaaa tctgaaatgc 58200 
ttattacaga catactacat gtgaccgaga gaacagaaaa ggatcatgca tgagtgatag 58260 
aaagagcaat aaaaggaaga actaaggcaa gaggaggaga ataaagactg aaattcatct 58320 
tagccaggct tcaggggctt tttaatcaga atataaaagg tgcagagagt aaaatgtcct 58380 
tcaatttatt cagtggtcta cctgcagtgg ttctcaaata gtggtatgca taccaggctt 58440 
ccttcgagtg gcacacagac tatcgctgac taattaaagt aaaaaaatga acacaccttt 58500 
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caaccctttg atgcatatga taacacctga tgtgattagt aaattgatag gctaaacctt 58560 
ttattttaat ttttatttta ttatgtttgt tatgtattct atcatctgaa gctaatcttc 58620 
tcccaatatt tgtaacgctt gttgggggcg tatccgtttt tttaatatcc cattgttgat 58680 
tggtatgatt taatgaactg cttttctgac acacaaagct tactctaaca cgcagtcagc 58740 
agate taatt gaaaatttaa atatgeaaat catctatttc aaatatgttt agatttcaaa 58800 
gataaacaaa tgeatgeaac atatttcaaa aatgaaagat tttgaaaaaa agagtttata 58860 
tacgaatatg atgacatata gectatattt atgaggecca aggaacattt cctggtttta 58920 
atgaataggc ttttatatgt taatacttta catttaagta cagcagtttt cttttaactt 58980 
tttaagaaca gtgctgtttt aatctacttt taaaagcaca tttaactcaa acattattta 59040 
tttttttaca tgtgagcaca gtgcagtgta aatgttccaa cgatttaaaa tgttttaagt 59100 
ggctgacaat aataaatact cataatagaa attaatctgc ccgcttttga actgtgcaga 59160 
gttgtagctg cttaattagg cctgctacgc tactgtattt taatactgat cataatggtg 59220 
gtacttggag atagggctat gegattaate caaattgaat cgcaatcaca atttgaaaag 59280 
ttgtgattag ttaaatcgea caaggctgea atataaaata tatatgtata tatgtaaaca 59340 
aaaataataa ataacatttt caaaaacagt ctgetatget ttagaaaatt acacatgeta 59400 
gacattctgt gatagtgttg tgaggtatcg cagacatggt atcatttttc atgaactacg 59460 
ccacattaca caagaagaaa agactgaatc ttgtatcacg aagtccagtt gtcctctatg 59520 
gcttcccatg acaccctatg tcaaggaaat cgtgccgaaa tcatgtgatc tgacegggge 59580 
tttataacta gecaactgag tgagcaattt cattcagctc aatcagaaat gcgcaactga 59640 
acttcggcca caataaaaaa agaaaacaaa caggaaagcg ctaacaagtg gaattatagt 59700 
gtctgettet ccagaagcat taatagacaa attaatataa atgaaaaaca gtatttgtta 59760 
tatgggaata ttttggtttc aaagtcacag acgccaaaca aaaataggca atttttaaga 59820 
gctgttgcag aaatgtttac actgettaaa gactattaag ctaaaactaa aattaaatta 59880 
aaatcaactt gaagcttgta ttaaatttta aatcaagtcg caatatgtca aaaaataaat 59940 
aaaaaaataa atagcaacta gatatattgc ccaactggca cagccctact tggagagaca 60000 
attttttctg aggtgatact tgctgaaaaa agtttgagaa ccactgcatt caggcagaaa 60060 
cctgccatta aggcctccac acaatccctg attccctcaa tcacatccac agtagaagtt 6 0120 
tatatggaaa gggggttgaa actgattcaa atactgtcta aaatgaccta ctggaaaaac 60180 
aacagaaacc agagaaaaca cacattcaga cacaaatggt gggtttcaag atagtccaca 60240 
ccctgaaatt ctgaccgtct atttttatac ggcatagacc atcaattcat gctcactcca 60300 
gagttttcaa aaggatttca tatcctaaag gatttaactt tattcactga aacacagatt 60360 
tccaaactaa aagattgtac agtggcctgt ggcatggtaa acctgtcggt tccttacaat 60420 
gatggtaaat aaattcttat caacaaattc acctgaaaca gaaactgact agtgtcaaga 60480 
caaatgeaca tatactatag aactgaagag taaaaagagc aatcagtctc caacaggtct 60540 
gacaaaagat acttgtcata caagtttttc aggtgcatga caagaggaag cgtcatcacg 60600 
aagaaatagc tgactaaaaa tcagttatac ataaatgtgc tttattattt acatcaagta 60660 
aaaatatttc ataataacta cacctttttc caacaggtca cttttccccc tagcttacac 6072 0 
tgttatacag aatcttctaa atgtgacaga tagtgagtcg gctttgtgac agggaaagee 60780 
ctaagatgaa acaacacacc atataaacca ctgttatctt aacagtaaat cactgtgttt 60840 
atgtgaacaa ctataggtga agacgaaaga cgagatacag agagagtaat gaagaaacag 60900 
ctcatttgtt atggttggtt tcagttgaca acttcatcag agttttgttt tgagcttagc 60960 
taaegtaget tgctagctgg cttttcattg gtttctgaaa atttgeaccg ttgagaataa 61020 
gtgagaatgc aattgtatat ttaaacggtc tcactggcta cttaccctct gtagtaggct 61080 
tttacccgga cttggtgcgg gttttctccg gggtgggaca tggtgctgtc ccgcagcgtg 61140 
ggcattatgg actatcegta ctcctgctag cttagctaat gatgetagee agetategtc 612 00 
tgacgaggaa aaactacegg agttttaaaa etaaaegtat tctttggaca gcaatagtgc 61260 
cgtctccggt taaataaaca gccfcggctga aatataatta gtgaagttca gaeggaaatt 61320 
aateggacta ctagaagttc tgtcagaegg ggaaacactc ccttcttctg tcgcgcaggc 61380 
etcgagaegg actgatccaa acageggaat gtggaggggg aggatgagag ggggatttcc 61440 
agaagaaaaa tattttacag taacagegat tgattgtcac acaacaacaa caaaaaaaac 61500 
ttgtgactgt agttttaaaa agggtacaaa tgctaggcat ctgeaatatt gctgatgata 61560 
gagtattatg gcaatatggt tcaatgggtc tagttgaatg ggagagtctg gtaacattcc 61620 
gattatacca cacagaaatt gaagagggtg gagaacagta gectttatag gaagtacatt 61680 
acactgettt ttgtgggtct atgaataatt gaatttggag ggacgegatt aatgaagaaa 61740 
ttaaegctte tgtttaattc ttaccttcta cttcccagag tgaaegtact aaaaaatcca 61800 
geegcaaaaa cttcatagcc ttttaatact eggacaaggg cagtttctta aactgtgtac 61860 
aatattaaca ggggcagttc ttaataatag agggeccata tacctagata aataaaaatt 61920 
gtccctgttt ttctttatta ggatgttttt acttttgaac ttgtaaatcg tcacatgttt 61980 
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ccacatcaga 
acatctcctt 
ggaaaacaga 
tgaaggacga 
ataaatctat 
tcaaataaat 
tgcgggtagg 
actgaaagtt 
cttgaggaaa 
ataatctttt 
tatttttttt 
gctaacatct 
ctccaaagta 
ttattattaa 
tttttgtttg 
gtaacatcat 
aaaattaata 
taattttgta 
gaaagctgaa 
gtgtctgatg 
agaaatgcaa 
taatgtgttt 
gatggcttct 
gcttctcgaa 
atctcacttc 
gcctgaaggg 
cctttttcac 
tcgtcataaa 
cttcaatgac 
aacagatata 
atcagccaat 
tctgctgcag 
atgtggcaaa 
ggatttacag 
ttctgtgggc 
ccagctgata 
ggagagcatc 
aagaccacac 
ccaaaattgg 
gctgaggcat 
atcctgcctt 
ttggcacact 
gaatactgtt 
cttccggcag 
tttattaatc 
tgcagcggat 
actcatacac 
gtgagggaaa 
tcatttgcaa 
tttcgtgtca 
ccacgaagaa 
gtacagtgag 
ttttactaca 
gtaaattagg 
tggtctgtag 
ttttacccag 
aaaaatcctt 
aggagggcta 



aaaaagcctt 
taaccaataa 
aactaaatga 
aacctgcaaa 
aagtccctgc 
gaatattaaa 
caaattatta 
tatttaccct 
agtaaataca 
actttttccc 
ttataattta 
attaagtctc 
caacaaaaac 
tattattatt 
atgtttaaaa 
attaaaacat 
gttttatata 
catttttaat 
aaaactcctc 
gactgaaatc 
atgaatatgg 
tgtctctgct 
gaaggttgtc 
ggagcttcct 
atactaaaac 
acaagcagct 
acatttcatt 
caaacatgca 
actaagtttc 
ccgtacactc 
caaatggcaa 
tccacaccga 
gttgtgccag 
agggttcaca 
gcaaatgcct 
gaaaggcaac 
tctgaacaca 
cgggtgtcac 
acaatagaag 
tcggatggta 
gaatcaacgg 
ttaaacccat 
gctgaccatg 
gaaaaagcgg 
catggtcgcc 
gcccttccag 
tacagacaat 
ccggagcacc 
ctttgcaaca 
atatggacca 
ttaaggcagt 
ttttacaaaa 
tattttcaag 
ttaacagggc 
acaattgaaa 
aagaaacaaa 
cttctaataa 
atcattttta 



atatttaacc 
aacttcaaaa 
gacatatcag 
aacaataaat 
ataaataaaa 
atccacaaat 
aataaattac 
ctttcaaatg 
gaactatgtt 
ccaatatttg 
ttaacacaat 
ttcttcatta 
agtcctatta 
attattatgt 
atatcgaaag 
tgaatagctg 
ccaaggatac 
tctggcaaat 
cctgtaagtc 
ccacctgaaa 
tattagcaag 
gtgcagtctg 
tgtcgtttca 
ccacaatctg 
atgctggatg 
tgttattcat 
tctgaagcct 
agtcttttac 
aaatgtggat 
actggccact 
caactcaatc 
gcatcagaat 
acggcctggt 
aagaacgatc 
tgttgatgcc 
agtaactcaa 
caacacgtct 
tcctgtcacc 
attgggaaaa 
gggtcagatt 
tttaggctgg 
tagtaccaat 
tccatccctt 
catgtcattc 
acagtggaat 
tggcaaccca 
tcagcctatc 
cggtggaaac 
tttgcagccg 
aaatctctga 
tttgaaagca 
taatatgggc 
atactattca 
atgttaaggt 
aaaaacatag 
taaagtaaaa 
acatcatttg 
cttcatatta 



aataaagcaa 
tatttgccat 
cacacattat 
aaatacgcca 
taaataagta 
tcctgcata.a 
acaaaagttg 
Catattcatc 
caagaaatga 
tgtaatttta 
aatttctata 
agtctcttct 
ttattattat 
ttatcaatgt 
accagcattt 
aaagatagtt 
tatgaacaaa 
gcagttcttt 
tgtccgcttt 
gaagcagagg 
ctatcaaaaa 
ttgtccagcg 
agatggccaa 
caataacaaa 
atggaaacag 
tttaaactca 
gtaactgcaa 
aagtaaatta 
caaaaaatat 
agtccaactg 
catttaggca 
gggaaagaaa 
ctgagtattt 
agaaaaagag 
aaaggacaga 
at'aacccctc 
aacctagagc 
taagagcagg 
tgttgcctgg 
ttgctctcaa 
tggtggtgta 
tgagcatcat 
tatgaccaca 
atttattctt 
gaaccgccaa 
tctcagggaa 
caattcaccc 
ccacacaaac 
acaaatctgc 
ggaatatttc 
aaagcgcgtc 
ataactttgc 
gcttaaagtg 
aattaggcaa 
ctttagaaaa 
taaaagtaaa 
ggacaaattt 
caatgaatgc 



aatcttagca 
ttgctttata 
gctctcaagg 
atatataaat 
ataaataaat 
ataaatattt 
aggaattagg 
tattttcatt 
ataagcgttt 
tatacattta 
gggaaaacac 
gctcaccaaa 
tattattatt 
ttattaacat 
atctgagata 
tttttgtcaa 
agtgaggata 
tgaaatttct 
gcccaccagg 
aacaggatgc 
ataagcagct 
gagggttccg 
ccattctggc 
acaactgaat 
caggatattc 
tatctatatt 
gagatataaa 
taatttttta 
tccgtgggca 
ctcgttaagg 
tgtagacatg 
ggtgatttaa 
cagaaactgc 
aaatatccag 
ggagaatggc 
gtttacaact 
tggataggct 
aaactattcg 
tctcatgaat 
caagatgaaa 
atggtgtggg 
gtctatgcca 
gtgtactaat 
tttcttttcg 
cttatccagc 
acattcacac 
gtacctcatg 
agcctacaac 
agcaactgct 
cagtaccttg 
caacccggta 
catgatgaca 
ccaattaaag 
gtcattgtat 
ctaataatat 
taaagtaaaa 
gaaaaagaaa 
catgtagtgt 



atgttaaatg 

tagcctaatt 

cacgatggta 

aaattagtaa 

aatcaaataa 

aaatgaataa 

aaaatgctaa 

agcatgttga 

gaagggggaa 

tttatttgca 

tatctattaa 

cctgcattga 

attattatta 

ttaaattctt 

aaaatctttt 

agattattgt 

aagtcattta 

agtcattaaa 

atgtagatgt 

taaaatactt 

gtgatcattg 

gaaactgtgg 

tcaagctcaa 

ctacttctgg 

attattgtta 

ttattcttag 

agatttcaca 

tttaaaataa 

tatacgttat 

caaatattta 

gtgaagacga 

gtgactttaa 

tgatctactg 

agagcagcag 

cagactggtt 

gaggtatgca 

acagcagcag 

cacaggccca 

ctcgatttct 

gcaaggatcc 

ggataatttc 

cagcctacct 

gttctcccaa 

gctaagtccc 

ataagtttta 

actcattcac 

tctttggact 

agccgacaaa 

tttttgatgc 

taaatgtatg 

ctagtaaagt 

acacataata 

gcttaacaag 

gactgtggtt 

tgatcttttt 

tagtgtaaaa 

agaaatttac 

gtgaaacagg 
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ctcaatccaa gctaaccaaa agctttcgtt 
gactcacctg gtgctgcatc agagttgagt 
ggacttcagg tgttctagaa gctcagcagc 
gctctccact tcctgtctga ggaacttctt 
tactgataaa tggctcacga ataatcatgc 
tggagaaatt cagtgagaat gttttaattt 
ctattctaat ccagttcctg ggcgaggcag 
gcaagaagat cactgggtcg ctggttagaa 
ccctgccttt gtgtgggttt tcttccaggt 
gtggtacagg tgaattgggt aggctaaatt 
tgaatgtgtg tgtgaatgtt tcccagagat 
aaaaacttgc tgaataagtt ggcggttcat 
ggactaagct gacaagaaaa tgaatgaatg 
acagtaaatc tgactgcttg acactgaaaa 
taaaacacta gatgtcattc actttaaaca 
tgcacaataa tggaactgtc caactgttgc 
atgggcttaa tttaaattta ttaatttaaa 
attctctgct tttttttgca atttatttat 
aaaaattgtt acacgttaaa tcaattactt 
ttatacactt agtttaagag taggcactaa 
attttaaaat gtctggcctc tcccctcact 
tcaggtggtt ctcaaactgt ggtaacgcgg 
aatgtgtcat atgtacatgc tacatatatt 
atcatttatt ttattacata ttgcctatat 
tgatgaattg gctactatat gttaatactt 
tttttaaaaa cattaaccta acatttttaa 
gatgttctta gtttttatgt ttgcttgttt 
cagtgttaac attcagacaa ttcataatgt 
aataatagaa attaatctga tacgttttag 
gcttactacg ctactgtatt tcagtactgc 
ttttctgagg tgttacgtta tgaaaaaagt 
ttatttaatt aaaactatga actcagagct 
gaaattcacg taaattaggt caaattgtct 
ctgagaatta aacatgacct taataactcc 
tccgtgtttt aaatagtcca gacagcgctt 
acctaatctt ctcactcatt agacattact 
ttgaggatta ttttattttt ttaaaaatat 
caataaacgt tatttgaaat aaagagatag 
atacacactt ttgtgccatt taaatctgta 
tgtttacttg attaattctt tttacattta 
tggtaatttc tgagtcaatt ttataacatt 
aattgacttc aggataacta ataatagctt 
aggaccataa aaaataaaat tacaatttaa 
acacacacac acacacacac atatatataa 
tgtgtgtgtg tataaattgt aattttataa 
atatatatgt gtgtgtgtgt gtgtaaattg 
tattttttaa ttatatatat atatatatat 
atatatatat atatatatat atatatatat 
acacacacac acacannnnn rumnnnimnn 
nnnnnnimnn niumiinnnnn nimnnnniiriii 
atatatatat atatatatat atatatatat 
atatatatat atatatatat atatatatat 
atatatatat atatatatat atatatatat 
tatatgtata tatatatata tatatatata 
tatatatata tatatatata tatatatata 
tttttttttt tttttttttt tttttttttt 
gtttgtcttg gtcgggcatc tcagttgaga 
ctgtgaagta ccagtggcgt agctctcagc 



tgtcaaaagc acagagtgac agtgaatgag 6552 0 
tgagtgcagg tgagcagaga ttcagtatct 65580 
agcagcagca gcagggcttc ccaaagtact 65640 
cctgtgcaca agagcatttg tgttgctgta 65700 
agcaggaaga tgactttggg catctacatc 65760 
cattagagag gagtctgatg tgcatcaaat 65820 
tggcgcagta ggtagtgctg tcgcctcaca 658 80 
cctcggctca gttggtgttt ctgtgtttct 65940 
gctctggttt cccccacagt ccaaagacat 66000 
gtccatagtg gatgagtgcg tgtgtgtgtg 66060 
gggttgcggc tggaagggca tctgctgcct 6612 0 
tccgctgtgg cgacccccgg attaataaag 66180 
aataatccag ttccagctgc ttttaacaaa 66240 
ctggatttat aaaatgcatt gcaataatta 66300 
aactaggtca aatttaaagt ccccctctat 66360 
tgtccaactt attattatta ttttttttaa 6642 0 
tgggcctcat tgcaaagtaa atgattttag 66480 
ttatttattt aaaaatgagg attggatagt 66540 
tcacagagta taaatataag aaactagtta 66600 
ggcccaaccc tatttcacca aggtccactt 66660 
aatagtagtt atatagtatt aatctctagg 6672 0 
gcttcctgat agtggtacgc ggagatatca 66780 
tcaaaaatga tttatatatg aatatgacat 66840 
ttctgaggtc caggcaacat ttccaggtgt 66900 
tacatttaag tacaacactt ttcttactac 66960 
cttttaaaag cacattttaa tttaaatgtt 67020 
tttttgtttt ttttacatat aaacacagta 67080 
ttaaagtggc tgacaataat aaatattccc 67140 
aactgtgcag agctgtagct gctttactgg 67200 
tcattatggt ggtacttgga gagacaattt 67260 
ttgagaacca ctgctctagg tcaataatat 67320 
ttatcttatg ctcatggtta cgagattaca 67380 
tttttactta tttaaaaatg ttaaaaagaa 67440 
aaaagatata ctttatactt tgtgaatgaa 67500 
tacttctttg ttgaatgatt cagttgtttg 67560 
accacctact ggaagattta gattcttatt 6762 0 
atttttttca tattaaaatg caataaagta 67680 
ttcacacgca caaaattatt ttgtcatcat 67740 
acaatttctt gttattgtgt gttcagtaaa 67800 
actcaataat gacttgaaat cttctgttag 67860 
taacacagaa ttttcccaat agtttgcatt 67920 
taaagtggcc acctatcttt attttacccc 67980 
acacacacaa acacacacac acacacacac 68040 
agtgtgtgtg tgtgtgtgtg tgtgtgtgtg 68100 
ttaatttatt atgtatattt attttattat 68160 
taattttata attaatttat tatgtatata 68220 
atatatatat atatatatat atatatatat 68280 
atatatatat atacacacac acacacacac 6834 0 
nnnrmnniinn niuitmnririnn niinirnnimiin. 68400 
nnnnnnnimn imimnniinxin nimniuiiixinn 68460 
atatatatat atatatatat atatatatat 68520 
atatatatat atatntatat atatatatat 68580 
atatatatat atatatatat atatagtgta 68640 
tatatatata tatatatata tatatatata 68700 
tatatatata tatatatata tatatagana 68760 
ttcgcttttt ttcgcttttt taagaaggtg 68820 
cctgtccact atgggtgacc ctaccggtag 68880 
atcactgatg cacacaagcc ctcacagcac 68940 
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gtcaagctgc 
aactctaaca 
gaaaccgcct 
taccaacagc 
gtgaaatttg 
atacgtgccc 
cggggctcct 
gacccggatc 
gcgggccggt 
taccggcctg 
tcctccacac 
gagactgaag 
tccacggagg 
cttcttgaac 
aggagattag 
gtgtagattt 
tttaaacaat 
acagtctttg 
gcccatatac 
taaatcgtca 
aaattttagc 
cattatgctc 
acgcaaatat 
aaataaataa 
aattatttaa 
atgttggggg 
agcatgttga 
gaagggggaa 
atacacattt 
gaattattta 
ttcgtcctcc 
aatataatgt 
gaaaaaccaa 
gggatacatt 
agtctttata 
acctgtacct 
gaggaagaaa 
ggcaattata 
ggaggtgcat 
tgtccagcaa 
agaaaaagaa 
gtgactattt 
atagacttaa 
gcaagccccg 
agcgcttctt 
acctccagat 
ttatttacgt 
agttccaaaa 
taacctgaga 



aaaccgtgtg 
acaaccacta 
aaacgcatgt 
actgacaatc 
atgtattgta 
atgatcctcc 
tgggcagcgg 
tgacctccgt 
ccagcttctc 
cacgcttctt 
tccgaccgtg 
ctgcaggacg 
tgtcgcgctc 
gtcgcagagg 
ttcagagttt 
gtaaaccaac 
aattgatttc 
ttattaaata 
ctaaaaaaat 
tacgtttcca 
aatgttacat 
tcaaggcacg 
ataaacaaat 
atatgctaat 
ataataaata 
caataagaaa 
cttgaggaaa 
ataaactttt 
atttatttgc 
ctcatttatt 
atacgatggt 
aaatttcagg 
aactgaaaaa 
tttttctgtg 
tctgtgttct 
attcctgaag 
ttaatcaggg 
ttaatttcct 
tctccaggtt 
acaaacgttt 
atgtggtata 
ttaattttat 
ccattgtttg 
tcagaatcag 
gcctggtctg 
ccactcttca 
ttttgaggaa 
cactgcccta 
atgttctaag 



agggacecta 
atgtgccaaa 
caacctacag 
accaaaacac 
aacttattct 
atctggcact 
ctccttgctg 
gatgaacaca 
ccacagccgc 
cacagtggtg 
gatggagagg 
gagggcagga 
cagcaagaag 
catgttgaac 
agtagcgcgc 
tcaggttgcc 
agcgttaagt 
cagaaaatat 
aaatattgtc 
cgtcagaaaa 
gacatcttct 
atggtatgaa 
aagtaaataa 
aaataaataa 
gtgcgggcag 
atgctaaact 
agtaaatacg 
acttttcccc 
atatttattt 
tatatatttg 
gagctcattt 
atgttttcca 
cacaaaaacc 
tagctgtcag 
aggttgatgt 
gcagtgatag 
acagttcact 
acaccatgtt 
actacaagca 
actcacttca 
gtatccacag 
aaatcgatgt 
aattgtctaa 
taattttaaa 
agacccattt 
aaatcagctg 
ctaatttaat 
aaatagccac 
ctt 



acctacccta acccaacagc caacactaac 



aaccactcag 
caaagcatcc 
ccagcaatgt 
ataactattt 
aggatggctc 
atgtccacca 
cgtccatcag 
tgctttagct 
tccatcacac 
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<221> CDS 
<222> (205) 



. (1965) 



<400> 4 

ccgcggttcc ggctgctccg gcgaggcgac ccttgggtcg gcgctgcggg cgaggtgggc 60 
aggtaggtgg gcggacggcc gcggttctcc ggcaagcgca ggcggcggag tcccccacgg 120 
cgcccgaagc gccccccgca cccccggcct ccagcgttga ggcgggggag tgaggagatg 180 
ccgacccaga gggacagcag cacc atg tec cac acg gtc gca ggc ggc ggc 231 

Met Ser His Thr Val Ala Gly Gly Gly 
1 5 

age ggg gac cat tec cac cag gtc egg gtg aaa gec tac tac cgc ggg 279 
Ser Gly Asp His Ser His Gin Val Arg Val Lys Ala Tyr Tyr Arg Gly 
10 15 20 ' ' 25 

gat ate atg ata aca cat ttt gaa cct tec ate tec ttt gag ggc ctt 327 
Asp lie Met lie Thr His Phe Glu Pro Ser lie Ser Phe Glu Gly Leu 
30 35 40 

tgc aat gag gtt cga gac atg tgt tct ttt gac aac gaa cag etc ttc 375 
Cys Asn Glu Val Arg Asp Met Cys Ser Phe Asp Asn Glu Gin Leu Phe 
45 50 55 



acc atg aaa tgg ata gat gag gaa gga gac ccg tgt aca gta tea tct 
Thr Met Lys. Trp lie Asp Glu Glu Gly Asp Pro Cys Thr Val Ser Ser 
60 65 70 



423 



cag ttg gag tta gaa gaa gee ttt aga ctt tat gag eta aac aag gat 
Gin Leu Glu Leu Glu Glu Ala Phe Arg Leu Tyr Glu Leu Asn Lys Asp 
75 80 85 



471 



tct gaa etc ttg att cat gtg ttc cct tgt gta cca gaa cgt cct ggg 519 
Ser Glu Leu Leu lie His Val Phe Pro Cys Val Pro Glu Arg Pro Gly 
90 95 100 105 

atg cct tgt cca gga gaa gat aaa tec ate tac cgt aga ggt gca cgc 567 
Met Pro Cys Pro Gly Glu Asp Lys Ser He Tyr Arg Arg Gly Ala Arg 
110 115 120 

c gc tgg aga aag ctt tat tgt gee aat ggc cac act ttc caa gee aag 515 
Arg Trp Arg Lys Leu Tyr Cys Ala Asn Gly His Thr Phe Gin Ala Lys 
125 130 , 135 

cgt ttc aac agg cgt get cac tgt gee ate tgc aca gac cga ata tgg 663 
Arg Phe Asn Arg Arg Ala His Cys Ala He Cys Thr Asp Arg He Trp 
140 145 150 



gga ctt gga cgc caa gga tat aag tgc ate aac tgc aaa etc ttg gtt 
Gly Leu Gly Arg Gin Gly Tyr Lys Cys He Asn Cys Lys Leu Leu Val 
155 160 165 



711 



cat aag aag tgc cat aaa etc gtc aca att gaa tgt ggg egg cat tct 759 
His Lys Lys Cys His Lys Leu Val Thr He Glu Cys Gly Arg His Ser 
170 175 180 185 

ttg cca cag gaa cca gtg atg ccc atg gat cag tea tec atg cat tct 807 
Leu Pro Gin Glu Pro Val Met Pro Met Asp Gin Ser Ser Met His Ser 
190 195 200 
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gac cat gca cag aca gta att cca tat aat cct tea agt cat gag agt 855 
Asp His Ala Gin Thr Val lie Pro Tyr Asn Pro Ser Ser His Glu Ser 
205 210 215 

ttg gat caa gtt ggt gaa gaa aaa gag gca atg aac acc agg gaa agt 903 
Leu Asp Gin Val Gly Glu Glu Lys Glu Ala Met Asn Thr Arg Glu Ser 
220 225 230 

ggc aaa get tea tec agt eta ggt ctt cag gat ttt gat ttg etc egg 951 
Gly Lys Ala Ser Ser Ser Leu Gly Leu Gin Asp Phe Asp Leu Leu Arg 
235 240 245 

gta ata gga aga gga agt tat gee aaa gta ctg ttg gtt cga tta aaa 999 
Val lie Gly Arg Gly Ser Tyr Ala Lys Val Leu Leu Val Arg Leu Lys 
250 255 260 265 

aaa aca gat cgt att tat gca atg aaa gtt gtg aaa aaa gag ctt gtt 1047 
Lys Thr Asp Arg He Tyr Ala Met Lys Val Val Lys Lys Glu Leu Val 
270 275 ' 280 

aat gat gat gag gat att gat tgg gta cag aca gag aag eat gtg ttt 1095 
Asn Asp Asp Glu Asp He Asp Trp Val Gin Thr Glu Lys His Val Phe 
285 290 295 

gag cag gca tec aat cat cct ttc ctt gtt ggg ctg cat tct tgc ttt 1143 

Glu Gin Ala Ser Asn His Pro Phe Leu Val Gly Leu His Ser Cys Phe 

300 305 310 

i 

cag aca gaa age aga ttg ttc ttt gtt ata gag tat gta aat gga gga 1191 

Gin Thr Glu Ser Arg Leu Phe Phe Val He Glu Tyr Val Asn Gly Gly 

315 320 325 

gac eta atg ttt cat atg cag cga caa aga aaa ctt cct gaa gaa cat 1239 
Asp Leu Met Phe His Met Gin Arg Gin Arg Lys Leu Pro Glu Glu His 
330 335 340 ' 345 

gec aga ttt tac tct gca gaa ate agt eta gca tta aat tat ctt cat 1287 
Ala Arg Phe Tyr Ser Ala Glu He Ser Leu Ala Leu Asn Tyr Leu His 
350 355 360 

gag cga ggg ata att tat aga gat ttg aaa ctg gac aat gta tta ctg 1335 
Glu Arg Gly He He Tyr Arg Asp Leu Lys Leu Asp Asn Val Leu Leu 
365 370 375 

gac tct gaa ggc cac att aaa etc act gac tac ggc atg tgt aag gaa 13 83 
Asp Ser Glu Gly His lie Lys Leu Thr Asp Tyr Gly Met Cys Lys Glu 
380 385 390 

gga tta egg cca gga gat aca acc age act ttc tgt ggt act cct aat 1431 
Gly Leu Arg Pro Gly Asp Thr Thr Ser Thr Phe Cys Gly Thr Pro Asn 
395 400 405 

tac att get cct gaa att tta aga gga gaa gat tat ggt ttc agt gtt 1479 
Tyr He Ala Pro Glu He Leu Arg Gly Glu Asp Tyr Gly Phe Ser Val 
410 415 420 425 

gac tgg tgg get ctt gga gtg etc atg ttt gag atg atg gca gga agg 1527 
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Asp Trp Trp Ala Leu Gly Val Leu Met Phe Glu Met Met Ala Gly Arg 
430 435 440 

tct cca ttt gat att gtt ggg age tec gat aac cct gac cag aac aca 1575 
Ser Pro Phe Asp lie Val Gly Ser Ser Asp Asn Pro Asp Gin Asn Thr 
445 450 455 

gag gat tat etc ttc caa gtt att ttg gaa aaa caa att cgc ata cca 1623 
Glu Asp Tyr Leu Phe Gin Val lie Leu Glu Lys Gin lie Arg lie Pro 
460 465 470 

cgt tct ctg tct gta aaa get gca agt gtt ctg aag agt ttt ctt aat 1671 
Arg Ser Leu Ser Val Lys Ala Ala Ser Val Leu Lys Ser Phe Leu Asn 
475 480 485 

aag gac cct aag gaa cga ttg ggt tgt cat cct caa aca gga ttt get 1719 
Lys Asp Pro Lys Glu Arg Leu Gly Cys His Pro Gin Thr Gly Phe Ala 
490 495 500 505 

gat att cag gga cac ccg ttc ttc cga aat gtt gat tgg gat atg atg 1767 
Asp lie Gin Gly His Pro Phe Phe Arg Asn Val Asp Trp Asp Met Met 
510 515 520 

gag caa aaa cag gtg gta cct ccc ttt aaa cca aat att tct ggg gaa 1815 
Glu Gin Lys Gin Val Val Pro Pro Phe Lys Pro Asn lie Ser Gly Glu 
525 530 535 

ttt ggt ttg gac aac ttt gat tct cag ttt act aat gaa cct gtc cag 1863 
Phe Gly Leu Asp Asn Phe Asp Ser Gin Phe Thr Asn Glu Pro Val Gin 
540 545 550 

etc act cca gat gac gat gac att gtg agg aag att gat cag tct gaa 1911 
Leu Thr Pro Asp Asp Asp Asp lie Val Arg Lys He Asp Gin Ser Glu 
555 560 565 

ttt gaa ggt ttt gag tat ate aat cct ctt ttg atg tct gca gaa gaa 1959 
Phe Glu Gly Phe Glu Tyr He Asn Pro Leu Leu Met Ser Ala Glu Glu 
570 575 580 585 

tgt gtc tgatcctcat ttttcaacca tgtattctac tcatgttgcc atttaatgea 2015 
Cys Val 



tggataaact tgetgeaage ctggatacaa ttaaccattt tatatttgee acctacaaaa 2075 
aaaeacccaa tatcttctct tgtagactat atgaatcaat tattacatct gttttactat 2135 
gaaaaaaaaa ttaatactac tagcttccag acaatcatgt caaaatttag ttgaactggt 2195 
ttttcagttt ttaaaaggee tacagatgag taatgaagtt accttttttg tttaaaaaaa 2255 
aaaaag 2261 

<210> 5 

<211> 587 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Met Ser His Thr Val Ala Gly Gly Gly Ser Gly Asp His Ser His Gin 

15 10 15 

Val Arg Val Lys Ala Tyr Tyr Arg Gly Asp He Met He Thr His Phe 
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